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Abstract: We impose various oracle mechanisms on nondeterministic pushdown automata, which natu- 
rally induce nondeterministic reducibilities among formal languages in a theory of context-free languages. 
In particular, we examine a notion of nondeterministic many-one CFL reducibility and conduct a ground 
work to formulate a coherent framework for further expositions. Two more powerful reducibilities — 
bounded truth-table and Turing CFL-reducibilities — are also discussed in comparison. The Turing CFL- 
reducibility, in particular, makes it possible to induce a useful hierarchy built over the family CFL of 
context-free languages. Basic structural properties are proven for each level of this CFL hierarchy. The 
first and second levels of the hierarchy are proven to be different. The rest of the hierarchy (more 
strongly, the Boolean hierarchy built over each level of the CFL hierarchy) is also infinite unless the 
polynomial hierarchy over NP collapses. This follows from a characterization of the Boolean hierarchy 
. . . over the kth level of the polynomial hierarchy in terms of the Boolean hierarchy over the k + 1st level of 

' the CFL hierarchy. Similarly, the complexity class is related to the feth level of the CFL hierarchy. 

We argue that the CFL hierarchy coincides with a hierarchy over CFL built by application of many-one 
■ CFL-reductions. We show that BPCFL — a bounded-error probabilistic version of CFL — is not included 

in CFL even in the presence of advice. Moreover, we exhibit a relativized world where BPCFL is not 
, located within the second level of the CFL hierarchy. 
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[T| ■ 1 Backgrounds and Main Themes 

^ . A fundamental notion of reducibility has long played an essential role in the development of a theory of 

NP-completeness. In the 1970s, various forms of polynomial-time reducibility emerged mostly based on a 
model of oracle Turing machine and they gave a means to study relativizations of associated families of 
^ I languages. Most typical reducibilities in use today in computational complexity theory include many-one, 

— . truth-table, and Turing reducibilities obtained by imposing appropriate restrictions on the functionality of 

oracle mechanism of underlying Turing machines. Away from standard complexity-theoretical subjects, we 
'. will shift our attention to a theory of formal languages and automata. Within this theory, we wish to lay 

out a framework for a future extensive study on structural complexity issues by providing a solid foundation 
. for various notions of reducibility and their associated relativizations. 

Of many languages, we are particularly interested in context-free languages, which are characterized by 
context-free grammars or one-way nondeterministic pushdown automata (or npda's, hereafter). The context- 
free languages are inherently nondeterministic. In light of the fact that the notion of nondeterminism appears 
naturally in real life, it has become a key to many fields of computer science. The family CFL of context- 
free languages has proven to be a fascinating subject, simply because every language in CFL behaves quite 
$H ■ differently from the corresponding nondeterministic polynomial-time class NP. For instance, whereas NP is 

closed under any Boolean operations except for complementation, CFL is not even closed under intersection. 
This non-closure property is caused by the lack of flexibility in the use of memory storage by an underlying 
model of npda. On the contrary, a restricted use of memory helps us prove a separation between the first 
and the second levels of the Boolean hierarchy {CFLfc | A: > 1} built over CFL, which was defined in 
by applying Boolean operations (intersection and union) alternatingly to CFL. Moreover, we can prove that 
a family of languages CFL(A:) composed of intersections of k context-free languages truly forms an infinite 
hierarchy [12' . Such an architectural restriction sometimes becomes a crucial issue in certain applications of 
pushdown automata. For instance, a one-way probabilistic pushdown automaton (or ppda) with bounded- 
error probability in general cannot amplify its success probability [5]. 

A most simple type of reduction is probably many-one reduction and, by adopting the existing formulation 
of this reducibility, we intend to bring a notion of nondeterministic many-one reducibility into context-free 
languages under the name of many- one CFL-reducibility. We write CFL^ to denote the family of languages 
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that are many-one CFL-reducible to a given oracle A. Notice that Reinhardt [13] earher considered many- 
one reductions that are induced by nondeterministic finite automata (or nfa's), which use no memory space. 
Our goal is to build a hierarchy of language families over CFL using our reducibility by way of immediate 
analogy with constructing the polynom,ial(-time) hierarchy over NP [15l[T6]. For this purpose, we choose 
npda's rather than nfa's. Owing mostly to the unique architecture of npda's, our reducibility exhibits quite 
distinctive features; for instance, this reducibility in general does not admit a transitivity property. (For 
this reason, our reducibility might have been called a "quasi-reducibility" if the transitive property is a 
prerequisite for a reducibility notion.) As a consequence, the family CFL is not closed under the many- 
one CFL- reducibility (that is, CFL^^'" ^ CFL). This non-closure property allures us to study the family 
CFL^I^ whose elements are obtained by the A:-fold application of many-one CFL-reductions to languages in 
CFL. As shown in Section IXTl the language family CFL^^^ turns out to coincide with CFL^^^^'^^ 

We further discuss two more powerful reducibilities in use — bounded truth-table and Turing CFL- 
reducibilities based on npda's. In particular, the Turing CFL-reducibility introduces a hierarchy analogous 
to the polynomial hierarchy: the hierarchy {A^^'^, S^^'^, 11^^'^ | ^ > 0} built over CFL, which we succinctly 
call the CFL hierarchy, and this hierarchy turns out to be quite useful in classifying the computational 
complexity of a certain group of languages. As a quick example, the languages Dup2 = {xx \ x S {0, 1}*} 
and Dup^ — {xxx \ x € {0, 1}*}, which are known to be outside of CFL, fall into the second level £2^^ 
of the CFL hierarchy. A simple matching language Match = {x^w \ 3u,v [w — uxv] } is in 5^2^^. Two 
more languages Sq — {0"1" | ?^ > 1} and Prim = {0" | n is a prime number } belong to 'S^^^ and Hg^'", 
respectively. A slightly more complex language MulPrim = {0™" | m and n are prime numbers } is also in 
jiCFL^ The first and second levels of the CFL hierarchy are proven to be different; more strongly, we can 
prove that Ti2^^ ^ ^f^'^/"-: where Ef^'^/n is a non-uniform version of Sf^'", defined in J/Tj and further 
explored in [22]. Regarding the aforementioned language families CFL(fc) and CFL^, we can show later that 
the families CFL(w) = U^>^ CFL(fc) and BHCFL = |Jfc>i CFLfc belong to the second level q of 

the CFL hierarchy, from a fact that CFL(a;) C BHCFL. Despite obvious similarities between their defini- 
tions, the CFL hierarchy and the polynomial hierarchy are quite different in nature. In Section [4. 11 we show 
that CFLJ^^'"''^-' is located within T,^^^ ■ Because of npda's architectural restrictions, "standard" techniques 
of simulating a two-way Turing machine, in general, do not apply; hence, we need to develop new simulation 
techniques for npda's. 

In this paper, we employ three simulation techniques to obtain some of the aforementioned results. The 
first technique is of guessing and verifying a stack history to eliminate a use of stack, where a stack history 
means a series of consecutive stack operations made by an underlying npda. The second technique is applied 
to the case of simulating two or more tape heads by a single tape head. To adjust the different head speeds, 
we intentionally insert extra dummy symbols to generate a single query word so that an oracle can eliminate 
them when it accesses the query word. The last technique is to generate a string that encodes a computation 
path generated by a nondeterministic machine. All the techniques are explained in details in Sections 13. If - 
13.21 Those simulation techniques actually make it possible to obtain three alternative characterizations of 
the CFL hierarchy in Section 14.21 

Reinhardt p4l related the aforementioned hierarchy of his to another hierarchy defined by alternating 
pushdown automata and he gave a characterization of the polynomial hierarchy by this alternating hierarchy 
using logarithmic-space (or log-space) many-one reductions. Using an argument similar to his, we can 
establish in Section [S] an exact characterization of the eth level of the Boolean hierarchy over the fcth level 
E^ of the polynomial hierarchy in terms of the corresponding eth level of the Boolean hierarchy over the 
k + 1st level of the CFL hierarchy. Moreover, we give a new characterization of Q\ (i.e., Wagner's [19] 
notation for Pt(E^_-^ [0(log n)])) in terms of the fcth level of the CFL hierarchy using log-space truth-table 
reductions. As an immediate consequence, all levels of the Boolean hierarchy over each level of the CFL 
hierarchy are different unless the polynomial hierarchy collapses. 

Another relevant notion induced by reducibility is a relativization of language families. For issues not 
settled by the current knowledge of us, we often resort to a relativization, which helps us discuss the existence 
of various relativized worlds in which a certain relationship among target language families either holds or 
fails. For instance, we can construct in Section l¥^ a recursive oracle for which the family BPCFL of languages 
recognized by bounded-error one-way ppda's is not included within the second level of the CFL hierarchy. (Of 
course, there also exists an obvious oracle that makes this inclusion hold.) This separation result contrasts 
a well-known fact that BPP is included in n IIj in any relativized world. To deal with oracle-dependent 
languages in a relativized CFL hierarchy, we utilize its characterization by bounded-depth Boolean circuits 
of alternating ORs and ANDs. Our proof relies on a special form of the well-known switching lemma [T|, 
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Figure 1: Hasse diagram of inclusion relations among language families 

in which a circuit of OR of ANDs can be transformed into another equivalent circuit of AND of ORs by 
partially setting and 1 to input variables. In the unrelativized world, we can prove that BPCFL ^ CFL/n. 
This separation extends a known result of [5] that BPCFL ^ CFL. 

A Hasse diagram in Fig|l] summarizes some of the inclusion relationships among language families dis- 
cussed so far. The notation CFLH in the figure denotes the union Ufe>i(^fc^^ ^ ^k^^)- 

Although most results in this paper are embryonic, we strongly believe that these results will pave a road 
to more exciting discoveries in structural complexity theory of formal languages and automata. 

2 A Preparation for Our Expositions 

We will briefly explain basic notions and notations that help the reader go through the subsequent sections. 
Generally, we will follow the existing terminology in a field of formal languages and automata. However, the 
reader who is familiar with computational complexity theory needs extra attentions to ceratin notations (for 
instance, CFL(fc) and CFLfc) that are used in quite different ways. 

2.1 Alphabets, Strings, and Languages 

Given a finite set A, the notation || A \\ expresses the number of elements in A. Let N be the set of all natural 
numbers (i.e., nonnegative integers) and set = N — {0}. For any number n G N^, [n] denotes the integer 
set {1,2,..., n}. The term "polynomial" always means a polynomial on N with coefficients of non-negative 
integers. In particular, a linear polynomial is of the form ax + b with a,b € N. The notation A — B for two 
sets A and B indicates the difference {x \ x € A,x ^ B} and V{A) denotes the power set of A; that is, the 
collection of all subsets of A. 

An alphabet is a nonempty finite set T, and its elements are called symbols. A string x over E is a finite 
series of symbols chosen from E and its length, denoted |a;|, is the total number of symbols in x. The 
empty string A is a special string whose length is zero. Given a string x = xiX2 ■ ■ ■ x„^iXn with Xi G E, 

represents the reverse of x, defined by x^ = XnXn-i ■ • ■X2X1. We set = 1 and 1 = 0; moreover, for 
any string x = X1X2 ■ ■ ■ Xn with denotes xi X2 ■ ■ ■ Xn. To treat a pair of strings, we adopt a track 

notation [ y] of |17j . For two symbols cr and r, the notation [ ^] expresses a new symbol and, for two strings 
X = x\X2 • ■ • Xn and y = yiy2 • • • yn of length n, [ y] denotes a string [ y}] [ ^2] ' ' ' [ y"] of length n. Since this 
notation can be seen as a column vector of dimension 2, we can extend it to a k-track notation, denoted 
conveniently by [xi, X2, . ■ . , Xk]'^ , where "T" indicates a transposed vector. 

A collection of strings over E is a language over E. A set E'^, where k €N, consists only of strings of length 
k. In particular, E'^ indicates the set {A}. The Kleene closure E* of E is the infinite union IJj,gN E''. Similarly, 
the notation E-*^ is used to mean IJ^L^ Given a language A over E, its complement is E* — A, which 
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is also denoted by A as long as the underlying alphabet E is clear from the context. We use the following 
three class operations between two language families Ci and C2: Ci A C2 = {A Ci B \ A £ Ci,B G C2}, 
Ci V C2 = {AU B \ A e Ci, B € C2}, and Ci - C2 = {A - B \ A e Ci, B e C2}, where A and B must be 
defined over the same alphabet. 

As our basic computation models, we use the following types of finite-state machines: one-way deter- 
ministic finite automaton (or dfa, in short) with A-moves, one-way nondeterministic pushdown automaton 
(or npda) with A-moves, and one-way probabilistic pushdown automaton (or ppda), where a X-move (or a 
A-transition) is a transition of the machine's configurations in which a target tape head stays still. Notice 
that allowing A-moves in any computation of a one-way pushdown automaton is crucial when output tapes 
are particularly involved. 

Formally, an npda M is a tuple (Q, E, $}, F, S, qq, Zq, Qacc, Qrej), where Q is a finite set of inner states, 
E is an input alphabet, F is a stack alphabet, Zq (g F) is the bottom marker of a stack, go (G Q) is the initial 
state, Qacc (C Q) is a set of accepting states, Qrej (^ Q) is a set of rejecting states, and S is a, transition 
function mapping (Q-Qhait) x (SU{A}) xF to ViQxT*) with E = EU{((;, $} and Qhait = Qacc^Qrej- Any 
step associated with an application of transition of the form S{q, A, a) is called a X-move (or a A-transition). 
The machine M is equipped with a read-only input tape and its tape head cannot move backward. On such 
a read-only input tape, an input string is surrounded by two distinguished endmarkers and $) and, as 
soon as a tape head steps outside of these endmarkers, the machine is thought to be aborted. The machine 
must halt instantly after entering a halting state (i.e., either an accepting state or a rejecting state). More 
importantly, we may not be able to implement an internal clock inside an npda to measure its runtime. 
Therefore, we need to demand that all computation paths of M should terminate eventually; in other words, 
along any computation path, M must enter a halting state to stop. For any of the above machines M, we 
write PATHm{x) to express a collection of all computation paths produced by M on input x and we use 
ACCm{x) to denote a set of all accepting computation paths of M on input x. 

Whenever we refer to a write- only tape, we always assume that (i) initially, all cells of the tape are blank, 
(ii) a tape head starts at the so-called start cell, (iii) the tape head steps forward whenever it writes down 
any non-blank symbol, and (iv) the tape head can stay still only in a blank cell. Therefore, all cells through 
which the tape head passes during a computation must contain no blank symbols. An output (outcome or 
output string) along a computation path is a string produced on the output tape after the computation 
path is terminated. We call an output string valid (or legitimate) if it is produced along a certain accepting 
computation path. When we refer to the machine's outputs, we normally disregard any strings left on the 
output tape on a rejecting computation path. 

The notations REG, CFL, and DCFL stand for the families of all regular languages, of all context-free 
languages, and of all deterministic context-free languages, respectively. An advised language family REG/n 
in [T7] consists of languages L such that there exist an advice alphabet F, a length-preserving (total) advice 
function /i : N — F*, and a language A G REG satisfying L = {x | [ h(\x\)\ G A}, where h is length preserving 
if |/i(n)| — n for all numbers n G N. By replacing REG with CFL in REG/n. Another advised family 
CFL/rt in [20.^ is obtained from CFL. A language over a single-letter alphabet is called tally and the notation 
TALLY indicates the collection of all tally languages. 

A multi-valued partial function f is in CFLMV if there exists an npda M equipped with a one-way read- 
only input tape together with a write-only output tape such that, for every string x, f{x) is a set composed 
of all outcomes of N on the input x along accepting computation paths |23j . 

2.2 Circuit Families and Higher Complexity Classes 

For higher complexity classes, we will review basic notions and notations. To handle time/spec-bounded 
computation, we use two models of two-way deterministic Turing machine (or DTM) and two-way nonde- 
terministic Turing machine (or NTM). Each of those machines is conventionally equipped with a single 
read/write input/work tape unless otherwise stated for clarity. Let P (rcsp., NP) be composed of all 
languages recognized by DTMs (resp., NTMs) in polynomial time. Given each index A; G N, we define 
AP = AP = EP = nP = P, AP^i = P^S EP+i = NP^S and nl^, = co-E?^,, where P^ = ^AecP^ 
(resp., NP'' = IJ^ggNP"*) and P'^ (resp., NP"*) is the family of languages recognized by polynomial-time 
DTMs (resp., NTMs) with adaptive queries to a set A, which is given as an oracle. Those language families 
constitute the so-called polynomial(-time) hierarchy [TS1[TB]. Let PH — [Jf^^^ E^. We denote by L the family 
of all languages, each of which is recognized by a certain DTM with a two-way read-only input tape and a 
two-way read/write work tape using O(logn) cells of the work tape. 
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Let 1-DLIN consist of languages recognized by one-tape linear-time DTMs. For the precise definition of 
and discussion on a one-tape linear-time deterministic computation, the reader refers to |17| . 

For each fixed constant fc G N, NC'" expresses a collection of languages recognized by log-space uniform 
Boolean circuits of polynomial-size and 0(log'' n)-depth. It is known that NC° is properly included within 
NC^; however, no other separations are known to date. Similarly, AC*^ is defined except that all Boolean 
gates in a circuit may have unbounded fan-in. Moreover, SAC^ demotes a class of languages recognized 
by log-space uniform families of polynomial-size Boolean circuits of O(logn) depth and semi-bounded fan-in 
(that is, having AND gates of bounded fan- in and OR gates of unbounded fan-in), provided that the nega- 
tions appear only at the input level. This class SAC^ is located between NC^ and NC^. Venkateswaran [18] 
demonstrated that the family of languages log-space many-one reducible to context-free languages character- 
izes SAC^ . Moreover, TC^ consists of all languages recognized by log-space uniform families of 0(log ri)-depth 
polynomial-size circuits whose gates compute threshold functions. 

3 Nondeterministic Reducibilities 

A typical way of comparing the computational complexity of two formal languages is various forms of 
resource-bounded reducibility between them. Such reducibility is also regarded as a relativization of its 
underlying language family. Hereafter, we intend to introduce an appropriate notion of nondeterministic 
many- one reducibility to a theory of context-free languages using a specific computation model of one-way 
nondeterministic pushdown automata (or npda's). This new reducibility catapults a basic architecture of a 
hierarchy built over the family CFL of context-free languages in Section 21 

3.1 Many-One Reductions by Npda's 

Our exposition begins with an introduction of an appropriate form of nondeterministic many-one reducibil- 
ity whose reductions are operated by npda's. In the past literature, there were preceding ground works 
on many-one reducibilities within a framework of a theory of formal languages and automata. Based on 
deterministic/nondeterministic finite automata (or dfa's/nfa's), for instance, Reinhardt 14j discussed two 
many-one reducibilities between two languages. Tadaki, Yamakami, and Li |17| also studied the roles of var- 
ious many-one reducibilities defined by one-tape linear-time Turing machines, which turn out to be closely 
related to finite automata. Notice that those computation models have no extra memory storage to use. In 
contrast, we attempt to use npda's as a basis of our reducibility. 

An m-reduction machine is an npda equipped with an extra query tape on which the machine writes 
a string surrounded by blank cells starting at the designated start cell for the purpose of a query to a 
given oracle. We treat the query tape as an output tape, and thus the query-tape head must move to 
a next blank cell whenever it writes a non-blank symbol. Formally, an m-reduction machine is a tuple 
(Q, E, $}, Q, r, S, (Jo, Zq, Qacc, Qrej), whcrc 9 is a query alphabet and S is now of the form 

6:iQ~ Qhait) X (S U {A}) X r ^ P(g X (6 U {A}) x F*). 

There are two types of A- moves. Assuming {p, r, ^) G d{q,a,j), if a = A, then the input-tape head stays still 
(or makes a A-move); on the contrary, if r = A, then the query-tape head stays still (or makes a A-move). 

We say that a language L over alphabet E is many-one CFL-reducible to another language A over 
alphabet F if there exists an m-reduction machine M using S and F respectively as the input alphabet and 
the query alphabet such that, for every input x G E*, (1) along each computation path p £ ACCm{x), 
M produces a valid query string yp G F* on the query tape and (2) x E L if and only ii yp £ A for an 
appropriate computation path p G AC C Mix). For simplicity, we also say that M reduces (or m-reduces) L 
to A. In other words, L is many-one CFL-reducible to A if and only if there exists a multi-valued partial 
function / G CFLMV satisfying L — {x \ f{x) D A ^ 0}. With the use of this new reducibility, we make 
the notation CFL^ denotes the family of all languages L that are many-one CFL-reducible to A, where the 
language A is customarily called an oracle. Making an analogy with "oracle Turing machine" that functions 
as a mechanism of reducing languages to A, we want to use the term "oracle npda" to mean an npda that is 
equipped with an extra write-only output tape (called a query tape) besides a read-only input tape. Given 
an oracle npda M and an oracle A, the notation L{M, A) (or L{M^)) denotes the set of strings accepted by 
M relative to A. 

Let us start with a quick example of languages that are many-one CFL-reducible to languages in CFL. 
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Example 3.1 As the first example, setting E — {0, 1}, let us consider the language Duy2 = {xx \ x G S*}. 
This language is known to be non-context-free (see, e.g., [7J|TT]); however, it can be many-one CFL-reducible 
to CFL, because an m-reduction machine nondeterministically produces a query word x^\y from every input 
of the form xy using a stack appropriately, and a CFL-oracle checks whether x ~ y from the input x^[\y 
using its own stack. In other words, Dup2 belongs to CFh^^. Similarly, the non-context-free language 
Dup3 = {xxx I a; G S*} also falls into CFL^^^. For this case, we design a reduction machine to produce 
x^'^y^y^\\z from each input xyz and make an oracle check whether x ^ y ~ zhy using its stack twice. These 
examples prove that CFL^^'^ ^ CFL. A similar language Match = {xi^w \ [w — uxv] }, where ^ is a. 
separator not in x and w, also belongs to CFL' 



CFL 



Unlike polynomial-time many-one reducibility, our many-one CFL-reducibility does not, in general, satisfy 
the transitivity property, in which A £ CFL^ and B e CFL^ imply A G CFL^ for any languages A, B, C. 
To prove this fact, notice that, if CFL is closed under the reducibility, then CFL^^'" — CFL must hold; 
however, this contradicts what we have seen in Example 13.11 This non-closure property certainly marks 
a critical feature of the computational behaviors of languages in CFL. In what follows, we will slightly 
strengthen this separation between CFL^^'" and CFL even in the presence of advice. 



Proposition 3.2 CFL^^^ ^ CFL/ 



n. 



To show this separation, we will briefly review a notion of fc-conjunctive closure over CFL. Given each 
number k G N"*", the k-conjunctive closure of CFL, denoted CFL(fc) in |23| . is defined recursively as follows: 
CFL(l) = CFL and CFL(fc + 1) = CFL(fc) A CFL. These language families truly form an infinite hierarchy 
|12| . For convenience, we set CFL(a;) = IJfegN+ CFL(fc). For known advised language families, it holds that 
CFL ^ REG/n [17], co-CFL ^ CFL/n and CFL(2) ^ CFL/n [3T]. In the following proof of Proposition 



we attempt to prove that CFL(2) C CFL^*^ and CFL(2) ^ CFL/ 



n. 



Proof of Proposition [3T2l Toward a contradiction, we assume that CFL^j^'^ C CFL/n. In this proof, we 
need the inclusion relation CFL (2) C CFL^f^. As for a later reference, we intend to prove a more general 
statement below. 

Claim 1 For every index k>l, CFL(fc + 1) C CFL^^^('=\ 

Proof. Let L be any language in CFL(fc -I- 1) and take two languages Li G CFL and L2 G CFL(fc) for 
which i = Li ni2- There exists an npda Mi that recognizes Li. Without loss of generality, we assume that 
Ml enters a final state (either an accepting state or a rejecting state) when it scans the right endmarker $. 
Now, a new oracle npda N is defined to behave as follows. On input x, N starts simulating AIi on x. While 
reading each symbol from x, N also copies it down to a write-only query tape. When Mi halts in a final 
state, N enters the same inner state. It holds that, for any input x, x is in i if and only if N on the input 
X produces the query string x in an accepting state and x is actually in L2. This equivalence implies that L 
belongs to CFL^=^ , which is a subclass of CFL^'^^^'') . □ 

Since CFL(2) C CFL,^f^ by Claim [H our assumption implies that CFL(2) C CFL/n. This contradicts 
the class separation CFL(2) ^ CFL/n, proven in [21]. Therefore, the proposition holds. □ 

A Dyck language L over alphabet E = {ai, 02, ■ ■ ■ , <Jd\ U {p'l, a'2, ■ ■ ■ , cr'^} is a language generated by a 
deterministic context-free grammar whose production set is jS' X\SS\aiSal : i G [d]}, where S* is a start 
symbol. For convenience, denote by DYCK the family of all Dyck languages. 

Lemma 3.3 CFL^^^ = CFL^^''^ ^ CFL^,^^-^. 

In the following proof, we will employ a simple but useful technique of guessing and verifying a correct 
stack history (namely, a series of consecutive stack transitions). Whenever an oracle npda tries to either 
push symbols into its stack or pop a symbol from the stack, instead of actually using the stack, we write its 
stack transition down on a query tape and ask an oracle to verify that it is indeed a correct stack history. 
This technique will be frequently used in other sections. 

Proof of Lemma [SH Since CFL^,^'^^ C CFL^^'^^ C CFL^f ^ trivially holds, we are hereafter focused 
on proving that CFL^f^ C CFL^^*^^. As the first step toward this goal, we will prove the following 
characterization of CFL in terms of Dyck languages. Notice that Reinhardt jl4j proved a similar statement 
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using a language called Lpp. 
Claim 2 CFL = NFA^^"^^. 

Proof. (C) For any language L in CFL, consider an npda M that recognizes L. Let F — {cti, CT2, • • • , ^d} 
be a stack alphabet of Mi. Corresponding to each symbol Ui, we introduce another fresh symbol a[ and then 
we set F' = {crj^, CTj- • ■ • i ""dl- Without loss of generality, it is possible to assume that M makes no A-move 
until its tape head scans the right end- marker $ (see, e.g., [7] for the proof). For convenience, we further 
assume that, when the tape head reaches $, M must make a series of A-moves to empty the stack before 
entering a halting state. Let us construct a new oracle npda N . In the following description of N , we will 
intentionally identify all symbols in F with "pushed down" symbols, and all symbols in F' with "popped up" 
symbols. Given any input string x, N simulates each step of M's computation made on the input x. At 
a certain step, when M pushes a string w = cFi^^ai^ ■ ■ ■ 6 F* in place of the top symbol of the stack, TV 
first guesses this top symbol, say, Uj and then writes down a'^w on its query tape. This is because Oj is on 
the top of the stack and thus it must be first popped up before pushing w. When M pops up a symbol, N 
guesses it, say, Ui and writes down a[ (not Oi) on the query tape. Finally, B is chosen to be a Dyck language 
over the alphabet F UF'. Note that, if a query word w encodes a series of correct stack transitions (with the 
stack becoming empty when the machine halts), then w obviously belongs to B. Therefore, it holds that L 
is in CFL,^, which is a subclass of CFL^^'^'^'^. 

(D) Assume that L G NFA^ for an appropriate oracle B in DYCK. Let us take an m-reduction machine 
Ml that reduces L to B. Since B is in DCFL (C CFL), take a deterministic pushdown automaton (or a 
dpda) M2 that recognizes B. As before, we assume that M2 makes no A-move. We will simulate both AIi 
and M2 by a certain npda N in the following fashion. Given any input a;, N starts the simulation of Mi on 
X without using a stack. If Mi tries to write a symbol, say, t on a query tape, then N instead simulates one 
step of M2 's computation that corresponds to the scanning of r together with a certain symbol on the top 
of the stack. It is not difficult to show that N accepts x if and only if x is in L. Therefore, it follows that 
L e CFL. □ 

Here, we claim the following equality. 

Claim 3 CFL^ = CFL,„(NFA;^) for any oracle A. 

Proof. (C) This is rather trivial by choosing an appropriate oracle nfa. 

(D) Assume that L € CFL^ for a certain language B in NFAj^. Let us take an oracle npda Mi recognizing 
L relative to B and an oracle nfa M2 recognizing B relative to A. A new machine N is defined to behave as 
follows. On input a;, N simulates Mi on x. Whenever Mi tries to write a symbol, say, a, on a query tape, 
since M2 has no stack usage, N can simulate one or more steps (including all possible A-moves) made by 
M2 while it scans a. Finally, N produces a query word exactly as M2 does. It is possible to show that N is 
an oracle npda that recognizes L using A as an oracle. Thus, we obtain the desired membership L € CFh^. 
□ 

By combining Claims H and H it follows that CFhfJ^ = CFL„(NFA^,^'^^) C CFL^^'^'^^. □ 

We will introduce another technique of simulating two or more tape heads moving at (possibly) different 
speeds by a single tape head. Let us consider an npda M with a write-only output tape. Since the tape 
heads of M may stay still at any moments (by making A-moves) on both input and output tapes, it seems 
difficult to synchronize the moves of those two heads so that we can split the output tape into two tracks 
and produce a string [ y] from input string x and output string y of M . The best we can do is to insert 
a fresh symbol, say, [\ between input symbols as well as output symbols to adjust the speeds of two tape 
heads. For this purpose, it is useful to introduce a terminology to describe strings obtained by inserting t]. 
Assuming that t] ^ S, a \\-extension of a given string x over E is a string x over E U {\\\ satisfying that x is 
obtained directly from x simply by removing all occurrences of 1] in x. For instance, ii x = 01101, then x 
may be OllllllOl or OlltjtjOlt]. 

Naturally, we can extend Dyck languages by adding a special symbol t] as a part of its underlying alphabet 
and considering d-tuples of strings over this extended alphabet. More formally, for each index d G N~^, 
DYCK^^* consists of all languages L such that there exist d extended Dyck languages Ai, A2, . . . , Ad for 
which L consists of elements of the form [xi,X2, ■ . . , Xd]'^ satisfying the following: for every index i € [d], Xi 
belongs to At. In particular, when d = 2, any language L in DYCK2^* has the form {[y] \ x € A,y G B} 
for certain extended Dyck languages A and B. It is worth noting that DYCK^^*^ is a subclass of DCFL(d) 
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(= A.^mDCFL). 

The following corollary generalizes Claim [31 



Corollary 3-4 For each fixed index d e N+, CFL(d) = NFA^'^"^^'' . 

In the proof of Corollary 13.41 to simplify the description of simulations of given oracle npda's, we need 
to introduce a special terminology. Let M be any oracle npda and A be any oracle. We say that a string 
w of the form [ |] encodes input x and query word y along a computation path of M if (i) along a certain 
accepting computation path p of M on x, M starts with the input x and produces a string y on its query 
tape, (ii) x and y are [^-extensions of x and y, respectively, and (iii) there is another oracle npda N that 
takes w and, by scanning each symbol in w by a single tape head, it can simulate the computation path p as 
follows. When scanning a symbol of the form [ !^], if cr ^ tj, then N simulates one step of M while scanning 
a on its input tape; on the contrary, if cr = 1], then N simulates one A- move of M without reading any input 
symbol. At the same time, if r ^ [\, then N simulates one step of M while writing r on its query tape; 
otherwise, N does nothing. Similarly, we define the concept of "[|] encodes stack history y and query word 
z along a computation path." 



Proof of Corollary 13.41 Let L be any language defined as L = Hiefd] ^« ^ languages Ai, A2, . . . , Ak 
in CEL. For each index i e [d], assume that an npda Mi recognizes A.^. Consider the machine N that, on 
input X, simulates several steps of Mi, M2, . . . , in parallel while they scan each input symbol. During 
this simulation, N writes a stack history of Mi onto the ith track of its query tape. However, to adjust the 
speeds of d tape heads, we appropriately insert the symbol t]. If a query word w correctly represent d stack 
histories of d machines, then w must be in DYCK^^* . □ 

For later use, we will generalize an argument used in the proof of Claimj^l We say that a language family 
C is \\-extendihle if, for every language A in C, two special languages A\ — {[|] | z G A} and A\ — {[f] I 2/ 6 ^} 
also belong to C, where y and z are any l]-extensions of y and z, respectively. 

Lemma 3.5 Let C be any nonempty language family. If C is \\- extendible, then CFLf^ C NFAJ^'"'^'^^'' holds. 

Proof. Take any oracle A in C and consider any language L in CFL^^. Moreover, let M be any m-reduction 
machine that reduces L to A. With a similar construction as in the proof of Corollarv 13.41 we will construct 
an oracle npda N that behaves as follows. On input x, N simulates M on x and produces on its own query 
tape strings of the form [ that encode stack history y and query word z along a computation path of M . 
Choose a Dyck language D that correctly represents any stack histories of M and then define B as the set 
{[ f] I y G e A]. It is clear by its definition that B belongs to DCFL AC. □ 

Hereafter, we will explore the properties of C¥1j'^^'^^\ First, we will see a few examples. 

Example 3.6 The language Sq = {0"1"' | n > 1} belongs to CFL^f^^^^ To see this fact, let us consider 
the following oracle npda N and oracle A. Given any input w, N first checks if w is of the form O'l^. 
Simultaneously, N nondeterministically selects (ji , j2 , ■ • ■ , jk ) satisfying j = ji+j2 + - ■ ■+jk, and it produces 
on its query tape a string w' of the form 0*^1-'^ t]!-'^ I] ■ ■ ■\\V''. An oracle A receives w' and checks if the 
following three conditions are all met: (i) ji = j2, = ji, • ■ •, (ii) 32 = js, ji = js, • ■ •, and (iii) 
i = fc by first pushing 0* into a stack and then counting the number of I]. It is rather easy to show 
that A belongs to CFL(3). Therefore, Sq is in CFL;^, which is included in CFL^^^^^^. A similar idea 
proves that the language Comp = {0" | n is a composite number} belongs to CFLJ^^'"*-^-'. In symmetry. 
Prim = {0" I n is a prime number} is a member of co-(CFL^^'^'-^-'). 



The lack of the transitivity property of the many-one CFL-reducibility necessitates an introduction of 
a helpful abbreviation of a k-fold application of the reductions. For any given oracle A, we recursively set 
= CFL,„ and C¥lj„^^f..+i] = CFL,„(CFL„[j,]) 

Wn[k] denotes the union Uagc C^^m[fc]- 



CFL;^[i] = CFL^ and CYh^^y^+i] ~ CFL,„(CFL^[j,]) for each index fc G N+. Given each language family 
C, the notation CFL^r^i denotes the union Uagc ^-^-'^mffcl- ^ close relationship between CFL(fc)'s and 



CFL^I^'s is exemplified below. 



Theorem 3.7 For every index fc e N+, CFL,^''^^'^) = CFL^^^^. 

As an immediate consequence of Theorem 13.71 the union UfceN+ ^-'^-'^mffc] tias a succinct expression of 
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Corollary 3.8 CFL^^i^^") = UfceN+ CFL^fl. 

Proof. Since CFL^f^^ = CFL^^'^C^) by Theorem O it holds that Ufc6N+ CFL^f^^ = UfceN+ CFL^f^^'^^ 
Thus, it suffices to show that UfceN+ CFL^^^^^' = CFL,^''^^'^) . Since CFL(fc) C CFL(u;), we obtain 
UfceN+ CFL^/^f*^) C UfeeN+ CFL,^/^("). The last term coincides with CFL^''^^'^) since CFL^''^^'^) is in- 
dependent of the value k. Conversely, let L be any language in an appropriate or- 
acle B € CFL(w) for which L G CFL^. Since CFL(a;) = UfeeN+ CFL(fc), B must be in CFL(/c) for 
an appropriate index fc G N+. This implies that L G CFL^ C CFL,^^^^''^ Therefore, it holds that 
CFL,^''^^'^) C |JfceN+ CFL'^^'-'^I This completes the proof of the corollary. □ 

Now, we are focused on the proof of Theorem O When fc = 1, it holds that CFL^^f^j = CFL,^f^(^' = 
CFL^^^. The proof of Proposition 13 . 71 for fc > 2 is made up of two lemmas. Lemmas 13.91 and 13.101 

Lemma 3.9 For every index k>2, CFL^^^^ C CFL^f^e^'^ holds. 

Proof. Let us consider the case where fc = 2. First, we claim the following inclusion relationship. 

Claim 4 For every index r G N+, CFL^^^V'^' - CFL,^f 

Proof. For a certain language A G CFL(r), let us assume that L G CFL^^]- Furthermore, choose 
an appropriate set B and let two m-reduction machines Mi and AI2 respectively witness the membership 
relations L G CFL^ and B G CFL^. We will define a new oracle npda N in part using a stack- history 
technique shown in the proof of Claim [51 Let F be a stack alphabet of Al2- Corresponding to F, we prepare 
an associated alphabet F' = {cr' | cr G F} and set FUF' to be a new stack alphabet. On input x, N simulates 
Ml on X in the following way. Whenever Mi tries to write a symbol, say, 6 on a query tape, N instead 
simulates, without using any actual stack, several steps (including A-moves) of M2 that can be made during 
reading b. When M2 tries to push down a string s by substituting a top symbol of its stack, N guesses this 
top symbol, say, a and then produces a's on the upper track of its query tape. When AI2 pops a symbol, N 
guesses this popped symbol, say, a and writes a' on the upper track of the query tape. At the same time 
during the simulation, N produces M2's query word on the lower track of the query tape. To fill the idling 
time of tape heads, we need to insert an appropriate number of symbols t] so that [ |] encodes stack history 
y and query word z along a computation path of M2 . Finally, we define C as a collection of strings of the 
form [ ^ such that y (resp., z) is a [^-extension of a correct stack history y (resp., a valid query string z in 
A). Since the above definition requires the correctness of y and z, C belongs to CFL(r) A CFL. Moreover, 
for every string x, x is in L if and only if there exists an accepting computation path in ACC^ix), along 
which N produces [|] in C. This relation implies that C is in CFL^. □ 

For the case of fc > 3, it holds that CFL^^^'^^] = CFL™(CFL^'['^!li]) C CFL,„(CFL^''^(''-i)), where the last 
inclusion comes from the induction hypothesis. Note that CFL^f^^ C CFL,„(CFL^^^('="i)) = CFL^pV""^^- 
Claim 12 then implies that CFL,%^ C CFL^^^C^-i^^^^^ ^ CFL^'^lC^). □ 

Lemma 3.10 For every index k>l, CFL,^^^^'') C CFL,^^^^^. 

Proof. By induction on fc > 1, we will prove the lemma. Since the lemma is trivially true for 
fc = 1, let us assume that fc > 2. Since CFL(fc) C CFL^^^^''"^^ by Claim [TJ it instantly follows that 
CFL^^^e^) C CFL„(CFL^^^('=~i)). Moreover, because CFL^^^^'^-^) C CFL^f^^_i] by our induction hy- 
pothesis, we conclude that CFL,^^^^'') C CFL„,(CFL^^'[;^lij) = CFL^I^. □ 



Toward the end of this section, we will make a brief discussion on a relationship between two language 
famihes CFL/n and CFL^"^^^^. Given a language A over alphabet S, the notation dense{A)(n) indicates 
\\A n S"!! for every length n G N. Let DENSE{f{n)) be the collection of all languages A such that 
dense{A){n) < f{n) holds for aU lengths n G N. Note that TALLY C DENSE{0{1)) C SPARSE, where 
SPARSE = DENSE{n'='(^y). 
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Proposition 3.11 1. CFL/n C CFL^^^^-^^'^^^" . 
2. CFL^ALi^Y ^ CFL^FL(2)/„. 

Proof. (1) This is rather obvious by taking an advice function h and define A — {h{n) \ n e N}, which is 
in DENSE{1) by the definition. 

(2) Let L e CFL;^ for a certain A in TALLY. Let M be a reduction machine that reduces L to A. 
Without loss of generahty, we assume that A C {1}*. For simphcity, we also impose a restriction that \ ^ A. 
Next, we will define A'^i as follows. Let p{n) = an be a linear polynomial that bounds the running time of M 
on inputs of length n > 1. We define our advice function h as — hih2 ■ ■ ■ hn for any number n G N"*", 
where each symbol h^ equals x^(l''~^^°+^) . • • , X'^(l"')]^- Note that \h{n)\ = n. We then 

define a language B to satisfy L = {x | [ h(\x\)] G B}. On input [ ^] with \s\ = \x\, Ni simulates M on x and 
generates a query string [V] if M makes a query y, where y and s are [^-extensions of y and s, respectively. 
In this process, if y ^ then iVi immediately enters a rejecting state. Another oracle npda N2 works 

as follows. On input of the form [ |], using a stack appropriately, N2 eliminates t] and produces y^s on its 
query tape, where # is a fresh symbol. The third machine iVa, taking y^s as input, finds the ith block 
hi — [61,62, • • ■ ,ba]'^ of s, where i — \\y\/a], and reads a symbol bj, where j — \y\ — {i — l)a. If bj = 1, 
then A3 enters an accepting state and, otherwise, it enters a rejecting state. This whole process puts B to 
CFL,„(CFL^f ^), which is CFL^,^2^. By Theorem Em it follows that B is in CFL^^'^^K It is not difficult to 

show that, for any string x, x e L if and only if [ h(\x\)] £ B. Therefore, L belongs to CFL^^^^'^^n. □ 



3.2 Other Powerful Reducibilities by Npda's 

In the previous sections, our primary interest has rested on the many-one CFL-reducibility. It is also possible 
to introduce two more powerful reducibilities, known as truth-table reducibility and Turing reducibility, into 
a theory of context-free languages. 

We define a notion of Turing CFL-reducibility using a model of npda with a write-only query tape and 
three extra inner states qquery, Qno, and qyes that represent a query signal and two possible oracle answers, 
respectively. More specifically, when an oracle npda enters qquery, it triggers a query, by which a query 
word is automatically transferred to an oracle. When an oracle returns its answer, (no) or 1 (yes), the 
oracle automatically sets the oracle npda's inner state to g„o or qy^s , respectively. Such a machine is called a 
T-reduction machine (or just an oracle npda as before) and is used to reduce a language to another language. 
Unlike many-one CFL-reductions, an oracle npda's computation depends on a series of oracle answers. Since 
an oracle npda, in general, cannot implement any internal clock to control its running time, we should 
demand that, no matter what oracle A is provided, its underlying oracle npda M must halt eventually on 
all computation paths. 

Lemma 3.12 For any oracle A, CYh^ C CFL^ = CFL^. 

Proof. The first inclusion is obvious because the Turing CFL-reducibility can naturally simulate the 
many-one CFL-reducibility by making a single query at the very end of its computation and deciding to 
either accept or reject an input based on an oracle answer. To see the last equality, let L be any language 
in CFLy, witnessed by a T-reduction machine M. Now, we want to prove that L G CFL;^ via another 
T-reduction machine A^. This machine A^ is defined to behave as follows. Given any input x, N simulates 
M on X and, whenever M receives an oracle answer, say, b e {0, 1}, A^ treats it as if b and continues the 
simulation of M . This definition clearly implies that A^ accepts x relative to A if and only if M accepts 
X relative to A. Thus, L is in CFL;^. We therefore conclude that CFL;^ C CFL;^. By symmetry, we also 
obtain CFL^ C CFL^, implying that CFL^ = CFL^. . □ 

As a simple relationship between the Turing and many-one CFL-reducibilities is exemplified in Propo- 
sition 13.131 To describe the proposition, we need a notion of the Boolean hierarchy over CFL, which was 
introduced in by setting CFLi = CFL, CFLzfc = CFLafe-i A co-CFL, and CFLsfe+i = CFLzfc V CFL. 
For simplicity, we denote by BHCFL the union UfcGN+ CFL^. Notice that CFL 7^ CFL2 holds because 
co-CFL C CFL2 and co-CFL ^ CFL. 

Proposition 3.13 CFL?^^ = CFL^^^^ = NFA^^^^ 
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For the proof of this proposition, the foUowing notation is required. If M is an (oracle) npda, then M 
denotes an (oracle) npda obtained from M simply by exchanging between accepting states and rejecting 
states. 

Proof of Proposition [37131 In this proof, we will demonstrate that (1) CFL^^^ C CFL^f^\ (2) 
CFL,^^^" C NFA^f"^'" , and (3) NFA^'"^" C CFL^'"^. If ah are proven, then the proposition immediately 
follows. 

(1) We start with an arbitrary language L in CFL^ relative to a certain language A in CFL. Take a 
T-reduction machine M reducing L to A, and let Ma be an npda recognizing A. Hereafter, we will build 
three new m-reduction machines iVi, A^2,-^3 to show that L e CFL^^'"^. On input x, the first machine A'^i 
tries to simulate M on x by running the following procedure. Along each computation path, before M begins 
producing the zth query word on a query tape, Ni guesses its oracle answer bi (either or 1) and writes it 
down onto its query tape. While M writes the zth query word y^, iVi does the same but appends yi\\ to bi. 
When M halts, iVi produces a query word w of the form biyi\]b2y2^ • ■ • ^buyk^, where fc G N. 

The second machine works as follows. On input w of the above form, N2 does the following procedure. 
On reading 6,;, If 6^ = 1, then N2 simulates Ma on yi. If bi — 0, then N2 skips y^. Whenever Ma enters 
a rejecting state, A''2 also enters a rejecting state and halts. The third machine takes input w and, if 
&i = 0, then N'i simulates Ma on yi\ otherwise, skips yi. It is not difficult to verify that A^i m-reduces 
L to L{N2)aT{Nz). This leads to a conclusion that L is included in CFL„(CFL A co-CFL) = CFL^''^^ 

(2) Note that CFL2 is ^-extendible. Proposition [S3] implies that CFL,^^^^ C NFA°°^^^°^^=^ . Since 
DCFL C co-CFL, it follows that DCFL A CFL2 C co-CFL A (CFL A co-CFL) = (co-CFL A co-CFL) A CFL. 
The last term clearly equals co-CFL A CFL = CFL2, and thus we conclude that CFL^^^" C NFA^^^^ 

(3) Choose an oracle A in CFL2 and consider any language L in CFL;^. Furthermore, take two languages 
Ai,A2 G CFL for which A = Ai H A2. Let M be an oracle nfa that recognizes L relative to A. Notice that 
M has no stack. We will define an oracle npda N as follows. On input x, N first marks on its query tape 
and start simulating M on x. Whenever M tries to write a symbol a on its query tape, writes it down on 
a query tape and simultaneously copies it into a stack. After M halts with a query word, say, w, N makes 
the first query with the query word Ow. If its oracle answer is 0, then N rejects the input. Subsequently, 
N writes 1 on the query tape (provided that the tape automatically becomes blank), pops the stored string 

from the stack, and copies it to the query tape. After making the second query with Iw^, if its oracle 
answer equals 1, then N rejects the input. When N has not entered any rejecting state, then N finally 
accepts the input. The corresponding oracle B is defined as {Ow \ w S Ai} U {Iw^ \ w G A2}. It is easy to 
see that x £ L if and only if N accepts x relative to B. Since CFL is closed under reversal (see, e.g., [7]), 
{Iw^ \ w £ A2} is context-free, and thus B is in CFL. we then conclude that L e CFLf C CFL^^^. □ 

As another typical reducibility, we are focused on nondeterministic truth-table reducibility. Notice that 
an introduction of nondeterministic truth-table reducibility to context-free languages does not seem to be as 
obvious as that of nondeterministic Turing reducibility. Ladner, Lynch, and Selman llOj first offered such a 
notion for NP. Another definition, which is apparently weaker than that of Ladner et al., was proposed by 
Book, Long, and Selman [5] as well as Book and Ko [5]. The following definition follows a spirit of Ladner 
et al. [To] with a slight twist for its evaluator. 

Letting k G N"*", a language L is in CFL^^^ if there are a language B G REG and an npda N with k 
write-only output tapes besides a read-only input tape such that, for any input string x, (1) AC C Mix) 7^ 0, 
(2) along every computation path p G ACCn{x), N produces a string yp^ G F* on the ith write-only 
query tape for each index i G [k], (3) from a vector {y^\y^\ . ■ . , y'ff') of query words, we generate a fc-bit 
string Zp — xtiv-p^ tV-p^ ^ ■ ■ ■ ' v'p'') ^ {0; Ij'^i and (4) a; is in L if and only if [ i^] is in B for an appropriate 
computation path p G ACCn{x). Here, the set B is called a truth table for A. For convenience sake, we 
often treat i? as a function defined as B{x,y) = 1 if [y] G i? and B{x,y) = otherwise. In the end, we set 
CFL^,, to be the union \J^^^,^CFhtu. 

It is clear that CFL^UCFL^ C CFhf^^. By flipping the outcome (accepting or rejecting) of a computation 
generated by each dfa that computes a truth-table, we obtain CFL^jj C CFL^j^. In symmetry, CFL^jj C 
CFL^jj also holds. Therefore, the statement given below holds. 

Lemma 3.14 For every language A and index k >1, GFL^ U CFL;^ C CFL^j^ = CFL^^j. 

Unlike NP, we do not know whether CFLJ^jj C CFL^ holds. This is mainly because of a restriction 
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of npda's memory use. It may be counterintuitive that Turing reducibiUty cannot be powerful enough to 
simulate truth-table reducibility. On the contrary, the following inclusion holds in a case of a constant 

number of queries. 

Lemma 3.15 Let k G be any constant. For every language A, CFL^^ C CFL^^j holds. 

Proof. Let M be any T- reduction machine that witnesses the membership L G CFLj5j-. To show that 
L G CFL^jj, we will build another 6tt-reduction machine N that is equipped with k query tapes. This 
oracle npda A'' works as follows. On input x, it simulates M on the input x. When M tries to write the 
ith query word, say, yi, N produces y-i on its ith query tape. The machine N then guesses an oracle answer 
fli G {0, 1} for yi (without any actual query) and remembers the value within the machine's finite control 
imit. After M halts in an accepting state, N makes k queries to A. Note that accepts x if and only 
if aia2 • • ■ Ofe = Xfc (2/17 1/2, • • • ,2/fc) along a certain accepting computation path. Thus, N btt-iednces L to A 
with only k queries. □ 



Theorem 3.16 CFL^""^^^ = CFLg^^ = NFAg^^. 

Before proving this theorem, we will present a characterization of BHCFL in terms of bounded-truth-table 
reductions. For this purpose, we need to introduce a new relativization of DFA. A language L is in DFA^j^ 
if there exists an oracle dfa that produces k query words yi,y2, ■ ■ ■ ,yk such that, for every string x, x G 
if and only if B{x, x^(2/i, ^2, • • • , Vk)) = 1- We set DFA^,, = \J^^^+ DFA^,, and DFA^,, = U^^c DFA^,, for 
any language family C. 

Lemma 3.17 BHCFL = DFA^^^^. 

Proof. We split the lemma into the following two separate claims: (1) BHCFL C DFA^j^'" and (2) 
DFA^j^^ C BHCFL. 

(1) Our goal is to prove by induction on G N+ that CFL^ C DFA^j^^. From this assertion, it follows 



that BHCFL = U,eN+ CFL^ C [j^^^^ DFA^/,^ = DFA^,^^ 

As for the base case fc = 1, it is clear that CFL C DFA^f^'" C DFA^^f". Now, let us concentrate on 
the induction step k > 2. Meanwhile, we assume that k is even. Since CFL;; = CFL;j_i A co-CFL, take 
two languages Li G CFLfc_i and L2 G co-CFL and assume that L = Lif\ L2. Assume also that an npda 
M2 computes L2. Since Li G DFA^^^j^-j^j by our induction hypothesis, there is an oracle dfa, say. Mi that 
recognizes Li relative to oracle A in CFL. On input x, Mi produces {k — l)-tuple (2/1,2/2, • • • ,2/fe-i) o^i its 
query tape and it satisfies that B{x, Xfe_i (2/1, 2/2, • • • , Vk-i)) = 1 if and only if x is in Li. We want to define 
a new machine N as follows. By simulating Mi, TV generates fc — 1 query words (yi,i/2, • • • ,2/fe-i) as well 
as a new query word t]a;, where t] is a fresh symbol. Moreover, we define A' = AU {[\x \ M2 accepts x}, 
which is obviously in CFL. Now, we define B' as {(a;, 6162 • ■ • 6fe-i6fc) | (a;, 6162 • • • &/c-i) & B Abk = 0}- 
Clearly, B' is regular since so is B. It also holds that B'{x,Xk (j/i, 2/2, • • • , 2/fc-i, tia^)) = 1 if and only if 
Bix,Xk-i{yi,y2, ■ • ■ ,2/fe-i)) = 1 and a; G I^. Therefore, L belongs to CFL^'^ C CFL^^^^. 

The case of odd fc is proven by slightly modifying the above proof 

(2) We will prove that, for any index fc G N+, DFA^^'j^ C CFLfez^+i- We begin with the case where 
fc = 1. Assume that L G DFA^^j for a certain language A in CFL. Let Mi be an oracle npda that recognizes 
L relative to A and let B be a truth-table used for Mi. Moreover, let M2 be an npda that recognizes A. 
Without loss of generality, we assume that M2 makes no A-move. A new machine Ni works in the following 
way. Given any input x, Ni first checks if B{x,l) = 1. Simultaneously, A^i simulates Mi on x. When Mi 
tries to write a symbol, say, 6, on the tth query tape, A^o simulates one step of M2's computation during 
the scanning of h. Similarly, we define A^o except that (i) it checks if B{x, 0) = 1 and (ii) if M2 enters an 
accepting state (resp., a rejecting state), then A^o enters a rejecting state (resp., an accepting state). It is 
important to note that this machine Nq is co-nondeterministic, and thus L{No) is in co-CFL, whereas L{Ni) 
belongs to CFL. It also follows that L = L{No) U L{Ni). Hence, L belongs to CFL V co-CFL, which is 
included in CFL3 C CFL4, as requested. 

For the case fc > 2, we need to generalize the above argument. Assume that L G DFA^j^ for a certain 
language A in CFL. Let Mi be a fctt-reduction machine that reduces L to A and let M2 be an npda 
recognizing A. Here, we assume that M2 makes no A-move. In the following argument, we fix a string 
b = 6162 •■•6fe e {0,1}'=. Letting CFL^°^ = C0-CFL2 and CFL^^^ = CFL2, we define CFL^*''"'"'"=^ as an 
abbreviation of CFL^*"^^ V CFL^*"^^ V ■ • ■ V CFL^^^K 
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Now, we will introduce two types of machines -/V^.o and -/V^.i. The machine N^^i takes input x and checks if 
-B(x, b) = 1. At the same time, Nh,i guesses a number i € [k] and simulates Mi on a; if 6^ = 1 (and, otherwise, 
it enters an accepting state instantly). Whenever Mi tries to write a symbol, say, a on its own query tape, 
Nb^i simulates one step of M2's computation corresponding to the scanning of a. As for the other machine 
Nb^Q, it simulates Mi on a; if 5i = (and accepts instantly otherwise). Note that Nt^o is a co-nondeterministic 
machine. For each index e S {0,1}, let A},^e be composed of strings accepted by Nt,^e and we set At, — 
Abfi U Ah^i, which obviously belongs to CFL2. It is not difficult to show that L = Ube{o i}*" ^b', thus, L is in 

Vhe{o.i}'= C^FL^''-', which equals Vbefo.i}'' ['^Vi:hi=i CFL2) V (Vi:hi=o C0-CFL2)]. By a simple calculation, the 

last term coincides with (Vfc=i ' CFL2)V(Vfe=i ' C0-CFL2). Since C0-CFL2 = CFLVco-CFL C CFLVCFL2, 

it follows that V-=i"' C0-CFL2 C V-'r'(CFL V CFL2) = (V-fr' CFL) V (V-fr' CFL2) = CFL2. 

Therefore, L belongs to Vi=i CFL2. Note that CFLfc2fc+i = Va:=i CFL2 by Claim [SI We thus conclude that 
L is in CFLj,2fc+i • n 

Finally, we will present the proof of Theorem l3.16l for this proof, we need to introduce the third simulation 
technique of encoding a computation path of an npda into a string. Notice that a series of nondeterministic 
choices made by an npda M uniquely specifies which computation path the machine has followed. We encode 
such a series into a single string. Let S he a transition function of M. First, we rewrite S in the following 
way. If 6 has an entry of the from S{q, cr, r, rj) — {(pi, Ci; Ci)i (P2, C2j C2)}, then we split it into two transitions: 
S{q,a,T,'r]) — (pi,^i,Ci) and S{q,a,T,r]) = {p2,£,2,C2)- Let D be the collection of all such new transitions. 
Next, we index all such new transitions using numbers in the integer interval [Hi^ll] = {1,2,...,||Z)||}. 
A series of transitions can be expressed as a series of those indices, which is regarded as a string over the 
alphabet S = [|| Z? ||]. We call such a string an encoding of a computation path of AI . 



Proof of Theorem [mil In this proof, we wih prove three inclusions: CFL^^^^ C NFA^^f^ C CFL„(DFA^J^) C 

CFI 

btt 



CFL^j^'". Obviously, these inclusions together ensure the desired equations given in the theorem 



CMm 5 CFL^FLcNFA,c,f . 

Proof. Fix k G and let L be any language in CFL^jj for a certain oracle A £ CFL. For this L, there 
is an oracle npda, say. Mi that recognizes L using A as an oracle. We want to define another machine N by 
modifying Mi in a similar way presented in the proof of Claim [21 On input x, N simulates Mi on x using 
fc + 1 query tapes as follows. When Mi tries to push aia2 - ■ ■ o-n in place of a certain symbol on the top of a 
stack, N guesses this symbol, say, a and then writes down a'ai • • • cr,i on the k + 1st query tape. If Mi pops 
a certain symbol, then N first guesses this symbol, say, tr and writes down cr' on the k + 1st query tape. 
Finally, we define B' as the set {[a;, yi, ?/2, ■ • • ,2/fc, 1]^ | B{x,yiy2 ■ ■ -yk) = 1}- Associated with A^'s k + 1st 
query, we choose an appropriate language C in DYCK . Define A' = AVJC , assuming that C is based on a 
different alphabet. Note that a: is in L if and only if B' {x ^ Xk+ii^)) = 1- Hence, L belongs to NFA^^^^jtti 
which is a subclass of NFA^j^^. □ 

Claim 6 NFA^tf^ C CFL,„(DFA^tf^). 

Proof. Let fc > 1. Assume that L G NFA^^ with a certain oracle A e CFL. Let M be a fctt- reduction 
machine that reduces L to A. To show that L e CFLm(DFAfjf '^), we define an oracle npda A^i as follows. 
Given any input a;, A^i simulates M by writing a string that encodes a computation path y of M . At any 
time when M tries to write any symbol on its own query tapes, A'^i simply ignores the symbol and continues 
the above simulation. Next, we define A''2 as follows. On input of the form [|], A^2 deterministically simulates 
M on X by following a series of nondeterministic choices specified by y, and N2 produces k query strings as 
M does. Note that A^i m-reduces L to L{N2,A) and that ^(^^2,^) is in DFA^j. Therefore, L belongs to 
CFLfJ^^'^' C CFL,„(DFA^J^). □ 

Claim 7 CFL™(DFA^J^) C CFL^^^^. 

Proof. Take any oracle A e DFA^j^'" and assume that L G CFL^. Let Mi be an m-reduction machine 
reducing L to A. Fixing fc G N"*", we assume that M2 is a fctt-reduction machine that reduces ^ to a 
certain language B in CFL. Now, we define N as follows. On input x, N simulates Mi on a:. When Mi 
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writes a symbol, say, b, N simulates one or more steps (including A- moves) of Af2's computation during the 
scanning of b. Finally, N outputs M2's k query strings. It holds that N fctt-reduces L to B, and thus L is 
in CFLf,, C CFLgf^ □ 

We have just proven that CFL^j^^ = NFA^^^^ = CFL,„(DFA^t^^). By Lemma [XT71 the last term equals 
CFL^"^^^. This completes the proof. □ 



3.3 Languages That are Low for CFL 

We will briefly discuss a notion of lowness, which concerns oracles that contain little information to help 
underlying oracle machines improve their recognition power. More precisely, consider a language family C 
that is many-one relativizable. A language A is called many-one low for C if C C holds. We define low,„C 
to be the set of all languages that are low for C; that is, low,„C — {A \ C C}. Similarly, we define lowf,ttC 
and low^C as a collection of all languages that are "btt low for C" and "Turing low for C," respectively. 

Languages in REG, when playing as oracles, have no power to increase the computational complexity of 
relativized CFL. 

Lemma 3.18 CFL = CFLJ^'^^ = CFL^^f^ ^ CFL^'^'^. 

Proof. Since CFL C CFL^' and E* G REG for S = {0, 1}, it follows that CFL C CFLJ^'^'^. Moreover, 
by Lemmas [XT^ and [XTil it holds that CVh^f^ C CFLf^f^ and CFL^f ° C CFL?'^^. To show that 
CFLj^jf*^ C CFL, take any language L in CFL^jj for a certain index k e N+ and a certain language 
A e REG. Let M be an oracle npda equipped with two write-only query tapes that fctt-reduces L to A. In 
addition, let N denote a dfa recognizing A. We aim at proving that L e CFL. Let us consider the following 
algorithm. We start simulating M on each input without using any write-only tapes. When M tries to write 
down a fc-tuple of symbols (si, S2, . . . , s^), we instead simulate N using only inner states. Note that we do 
not need to keep on the query tapes any information on (si, S2, ■ • ■ , Sfc). Along each computation path, we 
accept the input if both M and N enter accepting states. Since the above algorithm requires no query tapes, 
it can be implemented by a certain npda. Moreover, since the algorithm correctly recognizes L, we conclude 
that L is in CFL. 

Based on a similar idea, we can prove that CFL^^*^ C CFL. □ 



Lemma 3.19 1. REG C IowtCFL n lowbtfCFL C IowtCFL U lowbttCFL C low^CFL C CFL. 
2. lowbttCFL C CFL n co-CFL. 

Proof. (1) From Lemma IXTSl it holds that CFL?'^^ = CFLj^^f^ = CFL. Thus, it follows that REG C 
IowtCFL nlowbtt CFL. The third inclusion comes from the fact that CFL^ C CFL^j nCFL^ for any oracle 
A. The last inclusion is shown as follows. Take any language A in IowtCFL. This means that CFL;^ C CFL. 
Since A belongs to CFL;^, it holds that A e CFL;^ C CFL. Next, we wish to show that CFL ^ IowtCFL. If 
CFL = IowtCFL holds, we obtain CFL^"^^ C CFL. Since CFL(2) C CFL^^^ by LemmafU we immediately 
conclude CFL(2) = CFL. This is indeed a contradiction against the well-known result that CFL(2) ^ CFL. 
Thus, it must hold that IowtCFL ^ CFL. 

(2) For this inclusion, let us consider any language A in lowhjtCFL; that is, CFLJ^^ C CFL. Since 
A, A G CFL;;^t, it follows that A, A G CFL. Thus, A must belong to CFL n co-CFL. 

For the last proper inclusion, we assume that lowbttCFL = CFLHco-CFL. Thus, CFL^j^^'^™"^'"^ = CFL. 
This implies CFL°^^^ C CFL, because CFL^ C CFhf^^ and DCFL C CFL n co-CFL. However, this is a 
contradiction because CFLJ^'^^'" ^ CFL/n by Proposition 13.21 and Lemma [331 □ 

It is not known whether all inclusion relations in the lemma are actually proper. 



4 The CFL Hierarchy 

Nondeterministic polynomial-time Turing reductions have been used to build the polynomial hierarchy, each 
level of which is induced from its lower level by applying the reductions. With use of our Turing CFL- 
reducibility defined in Section 13.21 a similar construction applied to CFL introduces a unique hierarchy. 
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which we call the CFL hierarchy. We will explore fundamental properties of this new hierarchy throughout 
this section. 



4.1 Turing CFL-Reducibility and a Hierarchy 

Applying Turing CFL-reductions to CFL level by level, we can build a useful hierarchy, called the CFL 
hierarchy ^ whose ki\\ level consists of three language families A^^'", Sj?^'", and H^?^'^. To be more precise, for 
each level A: > 1, we set A^^^ ^ ^cfl ^ j^cfl ^ j^cfi. ^ ^^FL, A^fl = DCFLT(sr^), n^FL = co-S^fl^ 

and Y.'^ll = CFLt(S^''^). Additionally, we set CFLH = Ufc6N+ ^fe^^- 

The CFL hierarchy can be used to categorize the complexity of typical non-context-free languages dis- 
cussed in most introductory textbooks, e.g., [71 [TT]. We will review typical examples that fall into the CFL 
hierarchy. 

Example 4.1 We have seen in Example 13.11 the languages Dup2 — {xx | x G {0, 1}*} and Dup[i— {xxx \ 
X G {0, 1}}, which are both in CFL^^^. Note that, since CFL^ C CFL^ for any oracle A, every language 
in CFlj^f^ belongs to CFL^^^ = E^fl^ Therefore, Dup2 and Dupg are in E^fl^ addition, as shown in 
Example 1 3. 6 [ the language Sq = {0"1" | n > 1} is in CFL^^'"*-^'' while Prim = {0" | n is a prime number } 
is in co-(CFL^^^(^)). Owing to a proposition (Proposition|iHl) proven later, it holds that CFL^f ^^^^ C E^fl^ 
Therefore, we conclude that Sq is in EgF'" and Prim is in IlgF'". A similar but more involved example is 
the language MulPrim — {0"™ | m and n are prime numbers }. Consider the following three npda's. 
The first machine Mi nondeterministically partitions a given input 0*"' into (j/i, 2/2, ■ • ■ , ?/«) and produces 
— yi^y2^ • • • tl?/n on a query tape by inserting a new symbol t]. During this process. Mi pushes u = yi^l" to 
a stack, where ^ is a fresh symbol. In the end. Mi appends #m to w on the query tape. In receiving w^u as an 
input, the second machine M2 checks if y2i-i = y2i for i = 1, 2, . . .. At any moment when the checking process 
fails, M2 enters a rejecting state and halts. Next, M2 nondeterministically partitions yi into (zi, Z2, . . . , Ze) 
and 1" into {xi, X2, ■ . ■ , Xd) and then it produces w^u'^v' on the query tape, where u' = zi\\Z2\\ ■ ■ ■ \Zf, and 
v' = Xi\x2\\ • ■ ■ tjXd- In receiving w^t^u' the third machine A/3 checks if j/2i+i = ?/2i+i for i — 1,2,.... 
Whenever this process fails, M3 instantly halts in a rejecting state. Next, M3 nondeterministically chooses 
a bit h. lib — 0, then M3 checks if 221-1 — Z2i and also X2i-i — X2i for i = 1, 2, . . .; on the contrary, if 5 = 1, 
then A/3 checks if Z2i = Z2i+i and X2i = X2i+i for i — 1,2, . . .. If this checking process is successful, then 
M3 enters a rejecting state; otherwise, it enters an accepting state. By combining those three machines, 
MulPrim is shown to belong to CFL,„(co-(CFLj^"^^^)), which equals E^fl^ 

Several basic relationships among the components of the CFL hierarchy are exhibited in the next lemma. 
More structural properties will be discussed later in Section 

Lemma 4-2 Let k be any integer satisfying k > 1. 

1. CFLt(SCFL) = CFLT(nCFL) and DCFLt(ECFL) ^ DCFLT(nCFL). 

2. SCFLunCFLcA^FLcECFLnnCFL. 

3. CFLH C DSPACE(0(n)). 

Proof. (1) This is a direct consequence of Lemma [3.121 since II^fl _ co-S^fl^ ^pj^g ^^^gg Turing 
DCFL-reduction is similar in essence. 

(2) Let A: > 1. Since A G DCFL^ holds for any oracle A, it holds that E^fl c DCFLt(ECFL) = aCFL^ 
Similarly, we obtain H^fl c DCFLtIH^fL) ^ DCFLt(SCFL) = a^fl^ ^j^gj.g ^j^g gj.g^ equality comes 
from (1). Moreover, since DCFL^ C CFL^ for all oracles A, it follows that A'j^H = DCFLt(S^p^) C 
CFLt(E^fl-) ^ E^FL^ Finally, using the fact that DCFL^ = co-DCFL;^ for any oracle A, we easily obtain 

ACFL ^ CO-ACFL c CO-ECFL ^ n^FL. 

(3) Note that CFL belongs to DSPACE(0(logn)) [gSJ, which is obviously included in DSPACE(0(n)). 
Moreover, the fact that A is in DSPACE(0(n)) leads to CFL^ C DSPACE(0(n)), implying the desired 
containment E^^^ C DSPACE(0(n)) for every index k G N^. □ 

Hereafter, we will explore fundamental properties of our new hierarchy. Our starting point is a closure 
property under substitution. A substitution on alphabet E is actually a function s : E — >■ 7'(E*) and this 
function is extended from the finite domain E into the infinite domain E* as follows. Given a string x = 
X1X2 ■ ■ ■ Xn, where each Xi is a symbol in E, we set s(x) to be the language {yiy2 •■•?/« | * G [n], j/i G s{xi)}. 
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Moreover, for any language A C E*, we define s{A) — [j^^A^i^)- Each language family S^^^ is closed 
under substitution in the following sense. 

Lemma 4-3 (substitution property) Let k £ and let s be any substitution on alphabet E satisfying 
s(cr) S ^k^^ /'^'^ each symbol a € T,. For any language A over E, if L is in S^^'", then s{L) is also in S^^'^. 

Proof. Since the basis case fc = 1 is well-known to hold (see, e.g., [7]), it suffices to assume that k > 2. 
For each symbol cr in E, take an oracle npda Alg- that recognizes s{a) relative to a certain language A^ 
in n^'i^. In addition, let M denote an oracle npda recognizing L relative to a certain oracle A G S^^]^. 
For simplicity, we assume that, when each oracle npda halts, it must empty its own stack (except for its 
bottom marker). Consider a new oracle npda N that behaves in the following manner. On input w, N 
nondeterministically splits w into (yi, y2, • ■ • , J/n) satisfying w — 2/12/2 ••• 2/n- Sequentially, at stage i S [n], 
N guesses a symbol, say, cr^ g E and simulates using a stack one or more steps of M while reading ai. In 
the end of this simulation stage, N places a special marker # on the top of the stack in order to share 
the same stack with M^.. Next, N simulates Ma-^ on the input ^yi$ using an empty portion of the stack, 
provided that # is regarded a new bottom marker for M^. . To make intact the saved data in the stack 
during this simulation of M^. , whenever Ma-^ tries to remove N instantly aborts the simulation and halts 
in a rejecting state. When Mg-. queries a string, say, Zi, N instead produces aiZi on its query tape, where 
(Ti indicates which oracle is targeted. If all npda's enter certain accepting states, then N accepts w; 
otherwise, it rejects w. 

Finally, an oracle B is defined as the set of strings of the form az for which z is in Arj, where G E. It 
can be observed that w is in s{L) if and only if N accepts w relative to B. Moreover, since all A„^s are in 
E^FL^ the set B is also in E^fl Therefore, s{L) belongs to CFL^ C E^f^, □ 

Once the closure property under substitution is established for S^fl^ other well-known closure properties 
(except for reversal and inverse homomorphism) follow directly. 

Lemma 4-4 Por each index k £ the family S^fl closed under the following operations: concatena- 
tion, union, reversal, Kleene closure, homomorphism, and inverse homomorphism. 

Proof. When fc = 1, EfFL CFL) satisfies all the listed closure properties (see, e.g., [THH]). Hereafter, 
we assume that k > 2. All the closure properties except for reversal and inverse homomorphism follow 
directly from Lemma 14.31 For completeness, however, we will include the proofs of those closure properties. 
The remaining closure properties require different arguments. 

[union] Given two languages Ai and A2 in S^fl^ define L ~ {1, 2} and take a substitution s satisfying 
s{i) = Ai for each i G {1,2}. Since s{L) = A\ y^ A^, Lemma [4.31 implies that s(L) belongs to S^fl 
[concatenation] For any languages A\,A2 G S^fl^ ^ _ {12} and define s(l) = A\ and s(2) — Ai- 

Since s(V) = \xy \ x G Ai,y G A2}, we apply Lemma 14.31 to s{L) and obtain the desired containment 
s{L) G ECFL. 

[Kleene closure] Given any language A in S^fl^ .^^g dggng g[\^ — ^ ajid x — {1}*, which obviously imply 
s{L) = A*. Now, we apply Lemma l473l and then obtain the desired membership s{L) G S^fl^ 
[homomorphism] This is trivial since a homomorphism is a special case of a substitution, 
[inverse homomorphism] Let E and F be two alphabets and take any language A in E^fl Qygj. p ^j^^j g^j^y 
homomorphism h from E to F*. Our goal is to show that h~^{A) is in S^fl^ j^q^ ]\,f ]-,g g^j^ oracle npda that 
recognizes A relative to an oracle, say, B in S^fl Now, we will construct another oracle npda N. Given 
any input x — X1X2 ■ • ■ Xn of length n, N applies h symbol by symbol. On reading Xi, N simulates several 
steps (including A- moves) of Af 's computation conducted during the scanning of h{xi). If N accepts x using 
B as an oracle, then the string h(x) = h(xi) ■ ■ ■ h{xn) is in A; otherwise, h{x) is not in A. Thus, h~'^{A) 
belongs to CFLf C E^fl. 

[reversal] This proof proceeds by induction on fc G N+. As noted before, it suffices to show the induction 
step fc > 2. Assume that A G S^fl ^j^^j jg|- ^ j^g oracle npda that recognizes A relative to a certain 
oracle B G S^fl^ \Ye aim at proving that the reversal A^ — {x^ \ x E A} also belongs to S^fl_ Nqw, we 
will construct the desired reversing npda M^. First, we conveniently set our new input instance is of the 
form $x^^. Intuitively, we need to "reverse" the entire computation of M from an accepting configuration 
with the head scanning $ to an initial configuration. To make the following description simple, we assume 
that M has only one accepting state and that M empties its stack before entering a halting state. 
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A major deviation from a standard proof of the case /c = 1 is the presence of a query tape and a process 
of querying a word and receiving its oracle answer. Now, let us consider a situation that M produces a query 
word y and receives its oracle answer b. Since we try to reverse the entire computation of M, conceptuahy, 
we need to design Mji to (i) receive the oracle answer b from an oracle and then (ii) produce the reversed 
word on the query tape. However, we cannot know the oracle answer before actually making a query. To 
avoid this pitfall, we force to guess b and start producing j/^. After finishing y^, we force Mr to make 
an actual query. If its actual oracle answer equals b, then Mn continues the simulation of M; otherwise, 
enters a rejecting state. To make this strategy work, we also need the reversed oracle in place of B. By 
the induction hypothesis, the set B^ is in Sj?':]^. By formalizing the above argument, can be shown to 
be recognized by Mji relative to B^. □ 

In Example 13. 1[ we have seen that the two languages Dup2 and Dup^ are in Sg^'^. Since they are 
not context-free, these examples actually prove that Ef^'^ ^ ^2^'". Since co-CFL ^ CFL/n [20, and 
co-CFL C £2^'^, we obtain a slightly improved separation as shown in Proposition 14.51 

Proposition 4.5 E^''^ ^ CFL/n. 

Let us recall the language family BHCFL, the Boolean hierarchy over CFL. Here, we will show that the 
second level of the CFL hierarchy contains BHCFL. 

Proposition 4.6 BHCFL C S^fl p U§^^ . 

Proof. Obviously, CFLi C Eg holds. It is therefore enough to show that CFL?; C E2 for every index 
k > 2. For this purpose, we will present a simple characterization of the fcth level of the Boolean hierarchy 
over CFL, despite the fact that CFL A CFL ^ CFL. 

Claim 8 For every index k>\, CFL2fc = ^ ^e[k] CFL2 and CFL2fc+i ^ (V»e[fc] CFL2) V CFL. 

Proof. In [H Claim 4], it is shown that BCFL2fc = BCFL2fc-2 V BCFL2 (and thus BCFL2fc = 
Vie[fe] BCFL2 follows) for the family BCFL of bounded context-free languages. Essentially the same proof 
works to show that CFL2fc = Vie[fc] BCFL2. Moreover, since CFL2fc+i = CFL2A; V CFL by the definition, we 
obtain CFL2fc+i = (V,e[fc] CFL2) V CFL. □ 

Now, we want to show that CFL2fc, CFL2fc+i C Ej^'^ for all indices fc > 1. We proceed by induction on 
A: > 1. The case of fc = 2 will be shown as follows. 

Claim 9 CFL2 C E^FL^ 

Proof. Let L be any language in CFL2 and take two context-free languages A and B satisfying L = A(^B. 
Let M be an appropriate npda recognizing A. Consider the following procedure: on input x, copy x to 
the query tape and, at the same time, run M on x. When M enters an accepting state along a certain 
computation path, make a query x to B. This demonstrates that L is in CFL^, which is included in 
CFL™-^PL C CFL5?-c^L = ECFl_ □ 

Assuming fc > 2, let us consider the language family CFLg/c- Claim [5] implies that CFL2fe = Vieffc] CFL2. 
Since CFL2 C E^^^^ Claim [3 we obtain CFL2fe C Vje[fe] '^2^'^- As is shown in Lemma SH E^^l 
closed under union and this fact implies that CFLgfe C E2^'^. Next, we consider CFL2fc+i. Since CFLgfc+i — 
CFLzfeVCFL by the definition, the above argument implies that CFL2fc+i C E^^^VCFL. Since CFL C E^'^l 
and the closure property of EP'" under union, it follows that CFLgfc+i C E^^^ V E^^^ = ^2^^- As a 
consequence, we conclude that CFL2fc, CFLgfc+i C E^^^. Therefore, BHCFL C E^^^ holds. 

Furthermore, we will prove that BHCFL C H^^^. It is possible to prove by induction on fc G that 
co-CFLfc C CFLfc+i. From this inclusion, we obtain co-BHCFL C BHCFL. By symmetry, BHCFL C 
co-BHCFL holds. Thus, we conclude that BHCFL = co-BHCFL. Therefore, the earlier assertion BHCFL C 
Y.f^ implies BHCFL C H^''^ as well. □ 

Let us turn our attention to CFL(a;). A direct estimation of each language family CFL(fc) shows that 
CFL(a;) is included in BHCFL. 

Proposition 4.7 CFL(cj) C BHCFL (thus, CFL(w) C E^^^ p jjCfl 
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Proof. A key to the proof of the first part of this proposition is the foUowing claim. 



Claim 10 For every index k > 1, CFL(fc) C CFL2fc+i holds. 

Proof. We will proceed by induction on fc > 1. When k ~ 1, the claim is obviously true since CFL(l) = 
CFLi C CFL3. For the induction step, assume that k >2. The induction hypothesis implies that CFL(fc ~ 
1) C CFLafe-i. Since CFL(fc) = CFL(/fc - 1) A CFL, we obtain CFL(fc) C CFLzfe-i A CFL. In contrast, it 
follows by the definition that CFL2fc+i = CFL2/C V CFL = (CFL2fc-i A co-CFL) V CFL. The last term equals 
(CFL2fe-i V CFL) A (CFL V co-CFL). Clearly, this language family includes CFL2fc-i A CFL as a subclass. 
Therefore, we conclude that CFL(fc) C CFL2fe+i. □ 

By Claim [ini it follows that CFL(a;) = UfceN+ CFL(/c) C UfceN+ CFLsfe+i C BHCFL. The second part 
of the proposition follows from Proposition 14.61 □ 

Let us argue that the language family CFL^^'^'-'^^ is located within the third level of the CFL hierarchy. 

Proposition 4.8 CFL^^^(") C S^fl 

Proof. Proposition EJ] implies that CFL^^^'") is included in CFL^'^^^ . By Theorem SH it follows 
that CFL™^^^ is included in CFUn{Il^^^), which is obviously a subclass of CFLt{II^^^) = Sp^^ □ 



4.2 Structural Properties of the CFL Hierarchy 

After showing fundamental properties of languages in the CFL hierarchy in Section 14.11 we will further 
explore structural properties that characterize the CFL hierarchy. Moreover, we will present three alternative 
characterizations (Theorem 14.111 and Proposition I4.15P of the hierarchy. 

Let us consider a situation in which Boolean operations are applied to languages in the CFL hierarchy. 

Lemma 4.9 1. E^fl ^ e^fl ^ ^cfl ^CYi. ^ ^cfl ^ ^cfl ^ > 

2. ECFL a nCFL c ECFL p nCFL arid ECFl V nCFL c ECFL p H^FL for anyk>l. 

Proof. In what follows, we are focused only on the E^^" case since the H^'" case is symmetric. 

(1) When k — 1, since CFL is closed under union, the statement follows immediately. Assume that k > 2 
and let L be any language in E^^" V E^^". Now, we take two oracle npda's Mo,Mi and two languages 
A,B & satisfying that L = L{Mo, A) U L{Mi,B). Our goal is to show that L G E^^. Let us consider 
another oracle npda M that behaves as follows. On input x, M guesses a bit b, writes it down on a query 
tape, and simulates Mt on x. Thus, when Mb halts with a query word yb produced on its query tape, N 
does the same with byb- Let us define C as the union {Oy \ y G A} U {ly \ y G B}. We will show that C 
is in n^^. To see this fact, consider the complement C. Note that C — {Oy \ y & A} L) {ly \ y G B}. 
Because A,Bg ^k-^^ by induction hypothesis, C belongs to 'S'j^Yi- Since L — L{N,C) holds, we conclude 
that L G CFLg C E^fl. 

(2) Assuming k > 2, let L be any language in Ej?^'" AII^'" and take an oracle npda M and two languages 

we define a new oracle npda N to simulate M 
on input x and produce an encoding [ |] of a computation of M on x with query word y. Next, let us define 
C as the set {[|] | x G A,y G B}, which belongs to by (1) using a fact that H^}^ C H^^. Since 

L — L{N , C), L is in CFL^, which is a subclass of E^_^J' . In a similar fashion, we can prove that L G H^jJ^J" . 
□ 

What is missing in the list of the above lemma is two language families E^^^ A E^^" and n^^" V H^^^. 
As we have seen, it holds that CFL A CFL = CFL(2) ^ CFL. Therefore, the equality E^fl /y ^^fl = j]Cfl 
does not hold in the first level (i.e., fc = 1). Surprisingly, it is possible to prove that this equality actually 
holds for any level more than 1. 

Proposition 4. 10 E^^ ^ j^cfl ^ j^cfl jjCfl y jjCfl ^ jjCfl i^^^i^ > 3. 

This proposition is not quite trivial and its proof requires two new characterizations of E^^" in terms of 
many-one reducibilities. Note that these characterizations are a natural extension of Claim [2] and, for our 
purpose, we introduce two many-one hierarchies. The many-one CFL hierarchy consists of language families 
S^fe^ and g f^) ^ foUo^3^ 5.CFL ^ nCFjL ^ DCFL, E^fl ^ CFL, H^^ ^ .^.^^cfl^ 
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and '^'i^k+i ~ ^^^•mi^'i^fk) ^'^^ k > 1, where the subscript "m" stands for "many-one." A relativized 



many-one NFA hierarchy, which was essentially formulated in [14) . is defined as follows relative to oracle A: 
2^m,o = DFA„, E„ 1 = NFA^, li^ ^^ = co-S^ , and = NFA^(n^_^ ) for every mdex 

A: > 1. Given a language family C, the notation E^^^'^ (or T^^^{C)) denotes the union IJ^^^ ^^,t'^ 



Theorem 4. 11 S^fl ^ ^cfl ^ Y.^^{DYCK) for every index k>l. 

Since the proof of Theorem 14.111 is involved, prior to the proof, we will demonstrate how to prove 
Proposition 14. 101 using the theorem. 

Proof of Proposition [mni In what follows, it suffices to prove that E^^^ A E^^^ = ^k^^, since 11^^^ V 
^k^^ = ^k^^ obtained by symmetry. First, take any language L in E^^^" A E^^^" and assume that L — 
Li n L2 for two languages Li, L2 S E^^^. Theorem [ilTl implies that Li and L2 are both in T,^^{DYCK). 
Now, we choose oracle npda's Mi and M2 that respectively recognize Li and L2 relative to oracles Ai and 
A2, where ^1,^26 U^^_^{DYCK). Let us consider a new npda TV that works in the following fashion. 
In scanning each input symbol, say, a, N simulates in parallel one or more steps of Mi and M2 using 
two sets of inner states for Mi and M2. Such a parallel simulation of two machines is possible because 
Ml and M2 use no stacks. Moreover, whenever Mi (resp., M2) tries to write a symbol, N writes it on 
the upper (resp., lower) track of its query tape. To write two query strings j/i and y2 of Mi and M2, 
respectively, onto A^'s single query tape, we actually write their t]-extensions. Now, let [ |] denote a query 
string produced by A^ so that [ V] encodes computations of Mi and M2. A new oracle B is finally set 
to be {[f] \ y £ Ai,z e A2}. Since Il^l^ = Il^^_^{DYCK), Lemma SHJl) ensures that B is also in 

tNFA 



Il^-^^_^{DYCK). By the above definitions, it holds that A^ m-reduces L to B. Therefore, it immediately 
follows that L e NFA^, C Y.fJ^{DYCK) = E^l, □ 

The first step toward the proof of Theorem 14.111 is to prove a key lemma given below. 

Lemma 4.12 For every index k>l, it holds that E^|"^' C CFL„(E^^l A H^^^) C NFA„(E^''^ A W^^^). 

Proof. The proof of the lemma proceeds by induction on fc G N+. Notice that the base case fc = 1 has 
been already proven as Proposition 13.131 Therefore, in what follows, we aim at the induction step of fc > 2 
by proving the following two inclusions: (1) E^fl c CFL,„(ECFL jjCFL) (2) CFL™(ECFL jjCFL) ^ 

NFA„(ECFl AnCFL). 

(1) Let us recall the proof of Proposition I3.13[ in particular, the proof of the following inclusion: 
CFL^^'^ C CFLm(CFL A co-CFL). We note that this proof is relativizable (that is, it works when we 
append an oracle to underlying npda's). To be more precise, essentially the same proof proves that 
CFLT(CFLy) C CFLm(CFL^ A co-CFL^) for any oracle A. If we choose an arbitrary language in E^fl 
A, then we conclude that E^fl c CFL„(ECFL n^FL). 

(2) By setting C = E^^^^^ A n^^^^^ in Lemma [331 we obtain CFL™(E^''l ^ jj^fL) c NFA,„(DCFL A C). 
Note that DCFL A (E^fl /\ aB^fL) = ^CFl ^ (pcFL A B^fL). Since DCFL C B^^fl^ it instantly follows 
that DCFL A B^fl ^ B^fl n^FL ^ B^fl Lemma ITW l). In summary, we obtain the desired inclusion 
NFA„(DCFL A C) C NFA„(ECFL /\ B^fL). " □ 

The second step is to establish the following inclusion relationship between two language families NFAm(Ej^FL^ 
B^L) and CFL™(BC^L). 

Lemma 4.13 For any two indices fc > 1 and e>k-l, it holds that NFA™(E^fl jj^fl^ c CFL™(B^fl^_ 

Proof. Let L be any language in NFAj^, where A is a certain language in E^fl /\n^FL jnjq^^ .^^e express 

A as yli n A2 using appropriate languages Ai G CFL^^ and A2 G B^^^, where Bi G B^^^_j. Our goal is 
to construct an oracle npda A^ and an oracle C for L. The desired machine A^ takes input x and simulates 
M on X. When M tries to write a symbol, say, cr, A^ writes a on the upper track of its query tape and also 
simulates one or more steps of Mi's computation during the scanning of a using a stack. When Mi writes 
a symbol, A^ uses the lower track to keep the symbol. Finally, A^ produces a query string of the form [ |] , 
where y is a query word of M and z is a query word of Mi. Next, we define C to be {[ |] | y G A2, 2: G i?i}. 
This language C obviously belongs to B^fl ^jjCFL^^^ which equals B^fl Lemma ITW l). since e > fc — 1. 
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Therefore, L is in CFL^ C CFL™(nCFL). □ 
Finally, we are ready to give the proof of Theorem 14.111 



Proof of Theorem 14.111 The proof of the theorem proceeds by induction on fc > 1. Since Lemma 
handles the base case fc = 1, it is sufficient to assume that fc > 2. First, we will show the second equality 
given in the theorem. 

Claim 11 For any index fc > 1, Y^J; = Y^^^{DYCK) holds. 

Proof. If fc = 1, then the claim is exactly the same as Claim[21 In the case of fc > 2, assume that L e CFLJ^ 
for a certain language A in n^^^_j^. A proof similar to that of Claim [5] proves the existence of a certain Dyck 
language D satisfying that CFL^ = NFA^, where B is of the form {[ |] \ y & D,z & A} and y and z are 
[^-extensions oiy and z, respectively. The definition places B into the language family T>GF\j /\I\^^_t^, which 
equals nCFL_^ because of fc > 2. By the induction hypothesis, nCFL_^ = I\^^^_^{DY C K) holds. It thus 
follows that NFA^ C ^YA^{n^^^_^{DY C K)) = i:^^^{DYCK), and thus we obtain L e Y^^^{DYCK). 



Next, we will establish the first equality in the theorem. Clearly, S^^^ Q ^fc holds since cfl;^ C CFL^ 
for any oracle A. Now, we target the other inclusion. By Lemma [4. 121 it follows that S^^^ C NFAm(S^^^ A 
n^FL). Since S^FL ^ eCFL_^^ ^Yyi&in S^fl g NFA„{I]CFL_^ nCFL_ Lemma OS] further implies 
that NFA^(E^L_i A n^L_^) ^ CFL„(n^L_i) ^ j.cfl_ conclusion,' S^fl c E^fl holds. □ 

An upward collapse property holds for the CFL hierarchy except for the first level. Similar to the notation 
CFLe expressing the eth level of the Boolean hierarchy over CFL, a new notation E^ g'" is introduced to 
denote the eth level of the Boolean hierarchy over E^fl^ Additionally, we set BHE^fl ^ Uggpj+ E^fl^ 
Notice that, when fc = 1, BHEfFL coincides with BHCFL. 

Lemma 4-14 (upward collapse properties) Let k be any integer at least 2. 

1. E^FL = s^FL and only i/CFLH = E^fl. 

2. ECFl ^ nCFL and only i/BHE^fl = ^cfl^ 

3. ECFl = nCFL .rnpUes E^fl ^ sCFL_ 

Proof. (1) It is obvious that CFLH = E^fl implies E^fl ^ ^^JL Nq^^ assume that E^fl ^ y^cfl^ 
By applying the complementation operation, we obtain II^fl _ n^FL_ Thus, it follows that E^fl _ 
CFLT(nCFL) ^ CFLT(nCFL) = sCFL_ Similarly, it is possible to prove by induction on e e N+ that 
S^FL = Y,^FL^ Therefore, CFLH = E^fl j^^j^g^ 

(2) Since H^fl g BHE^fl, obviously BHE^fl ^ 5]CFL implies E^fl ^ n^FL. Next, assume that 
E^FL ^ n^FL_ gy induction on e £ N+, we wish to prove that E^fl g E^fl_ rj,^^ ^^^^ where e = 1 is 
trivial. Firstly, let us consider the language family E^fl^^ for e > 1. Note that the induction hypothesis 
implies E^FL c E^FL. it thus holds that E^fl^^ = sCfl y sCFL g j^cfl y j^cfl ^ j^cfl Lemma ICTn. 
Secondly, we consider the family ECFl^2. It holds that E^fl^^ = ^k^2^+i A H^fl c E^fl /\ n^FL. Since 
n^FL = Y.CFL^ qI^^j^j^ ECFl^2 ^ E^FL jjCFL ^ jjCFL A H^FL = n^^FL by Lemma ICTl^. The last term 
obviously equals E^fl fj-om our assumption. Overall, we conclude that BHE^fl E^fl^ 

(3) Assume that E^fl ^ jjCFL ^^^^ f^^^g ^^j. attention to E^fl_ since Theorem KTT\ implies E^fl ^ 
E^FL^ our assumption is equivalent to E^fl ^ n^FL^ gy Lemma liTia it follows that E^fl g NFA„(E^fl /y 
n^fc ) = NFA,„(ECFL a eCFL)^ ^hich is included in NFA™(ECFL) by Lemma|4li;i). Since E^fl c Ep/L ^ 
U^%,, Lemmamih^plies that NFA„,(E^L) g NFA„(ECF^L'^nCFL_,) c CFL„,(n^L_i) = E^^ which 
equals E^fl by Theorem 14. 1 II again. □ 

From Lemma 14.141 if the Boolean hierarchy over E^fl collapses to E^fl^ then the CFL hierarchy 
collapses. It is not clear, however, that a much weaker assumption like E^fl _ E^fl^ suffices to draw the 
collapse of the CFL hierarchy (for instance, E^fl _ 1]CFL-|_ 

We note that Theorem 14.111 also gives a logical characterization of E^fl ^ -^g define a function Ext as 
Ext(x) = X for any [^-extension x of string x. 
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Proposition 4^.15 Let fc > 1. For any language L G over alphabet S, there exists another language 

A G DCFL and a linear polynomial p with p{n) > n for all n £ N that satisfy the following equivalence 
relation: for any number n G N and any string x G S", 

x^L if and only if 3x{\x\ < p(n))3yi{\yi\ < p{n))\/y2{\v2\ < p{n)) 

■■■Qkyk{\yk\ <p{n)) [x = Ext{x) A [x,yi,y2,-.-,ykV ^ A], 

where Qk is 3 if k is odd and is ^ if k is even. Moreover, x is a [^-extension of x. 

Let us recall from Section 13.21 the notion of "encoding of a computation path" of a nondeterniinistic 
computation. 

Proof of Proposition 14.151 This proof proceeds by induction on k G N+. First, we will target the 
case of fc = 1 and we begin with the assumption that L G Sf^'" {— CFL). Moreover, we assume that 
L is recognized by a certain npda, say, M and let p be a linear polynomial that bounds the running 
time of M . Consider the following new language A. This language A is defined as the collection of all 
stings of the form [ that encodes an accepting computation path of M on input x. It is not difficult 
to verify that A G DCFL. Moreover, the definition of A indicates that, for every string x, a; is in L if 
and only if there exist a [^-extension x oi x and [ ^^J is in A. The latter condition is logically equivalent to 
3i(|i| <p(n))32/i(|yi| < p{n))[x = Ext{S:) € A]. 

For the induction step fc > 2, let us assume that L G S^^'". Theorem 14.111 implies that L is many- 
one NFA-reducible to a certain oracle B in Il^f^_^{DYCK) n^^^) via an oracle nfa M. Note that 
the running time of M is upper-bounded by a certain linear polynomial, say, p. Since B is in S^^^, our 
induction hypothesis ensures that there are a linear polynomial q and a language C in DCFL such that, 
for any string zi, zi is in B if and only if Bz^^dzJ^j < q{n'))3u2{\u2\ < q_{n')) ■ ■ ■ Q'f^Uk{\uk\ < q_{n')) [[zi = 
Ext{z[) A [z[, U2, ■ . ■ , Mfc]"^ G C'], where Q'j, is 3 if fc is even and is V if fc is odd, and n' — \zi\. In a similar 
way as in the base case of fc = 1, we can define a language D in DCFL that is composed of strings [x, ui, zi]'^ 
that encodes an accepting computation path of M with input x and query word zi. Note that, for any 
given pair (i, ui), there is at most one string zi such that [x,ui, zi]'^ G D. From such a unique zi, a string 
zi = Ext{zi) is also uniquely determined. Note that x is in L if and only if there exist tj-extensions x of 
input X and z[ of query word zi and ui is an accepting computation path such that [x, ui, zi]"^ G I? Azi G i?. 

To complete the proof, we want to combine two strings [x, ui, zi]"^ and [z[, U2, ■ ■ ■ , Uk]'^ satisfying Ext{zi) = 
Ext(z[) into a single string by applying a technique of inserting [\ so that a single tape head can read off 
all information from the string at once. For convenience, we introduce three languages. Let D' = {[-jj \ 
3ui,Si[Ext{wi) = [IJ] A G D]}, C = {[W2,y3,...,ykr \ 35[,y2, . . . ,yk[Ext{w2) = [ |] A 

(Ata Ext{y,) = u,) A [ij, U2, • • • , Ukf G C]}, and E - {[f/] | 3ui,zi [Ext{wi = [f,]AExt{Si) = Ext{S[)]}. 
Finally, we define a language G = {[x,yi,y2, ■ ■ ■ ,ykV I [yj ^ -D' A [|^] G E A[yi,y2, ■ ■ ■ ,ykY ^ C"}. It is not 
difficult to show that G is in DCFL. Now, let r{n) — q{p{n)) for all n G N. With this language G, it follows 
that, for any string X, X is in L if and only if3i(|x| < r{n))3yi{\yi\ < p{n))\/y2{\y2\ < r{n)) ...Qkyk{\yk\ < 
r{n)) [x — Ext{x) A [x, yi, y2, ■ ■ ■ , yk]'^ S G]. Therefore, we have completed the induction step. □ 

4.3 BPCFL and a Relativized CFL Hierarchy 

Let us consider a probabilistic analogue of CFL. The bounded-error probabilistic language family BPCFL 
consists of all languages that are recognized by ppda's whose error probability is bounded from above by 
an absolute constant e G [0,1/2). This family is naturally contained in the unbounded- error probabilistic 
language family PCFL. Hromkovic and Schnitger [8j studied properties of BPCFL and showed that BPCFL 
and CFL are incomparable; more accurately, BPCFL ^ CFL and CFL ^ BPCFL. It is possible to strengthen 
the last separation using advice in the following way. 

Proposition 4. 16 BPCFL ^ CFL/71. 

Proof. Let us consider the example language Equals that is composed of all strings w over the alphabet 
Sg = {ai, 02, . . . , ag, #} such that each symbol except # appears in w the same number of times. It is shown 
in that Equals does not belong to CFL/n. To complete this proof, it is enough to show that Equals 
actually falls into BPCFL. 
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We set = 5 and consider the following probabilistic procedure. Let w be any input and define 
c^i — #ai(w) for every i G [6]. In the case where all a^'s are at most A^, we deterministically determine 
whether w £ Equals without using any stack. Hereafter, we consider the case where ai > N for all z G [6]. 
We randomly pick up two numbers x and y from [N]. We scan w from left to right. Whenever we scan oi 
(resp., 02 and 03), we push 1 (resp., 1^ and 1^) into the stack. On the contrary, when we scan 04 (resp., 05 
and og), we pop 1 (resp., 1^ and 1^) from the stack. If the stack becomes empty during the pop-ups, then we 
instead push a special symbol "—1" to indicate that there is a deficit in the stack content. If this happens, 
when we push I's, we actually pops the same number of — I's. After reading up w, if the stack becomes empty, 
then we accept the input; otherwise, we reject it. Note that there is ^ = | (ai — a^) + x{a.2 — 0^5) + 2/(a3 — ag) 
symbols in the stack. 

If w is in Equals, then £ = obviously holds for any choice of {x, y) G [A^] x [A^]. Conversely, we assume 
that w ^ Equals smd we will later argue that the error probability e (i.e., the probability of obtaining £ = 0) 
is at most 1/3. This clearly places Equals in BPCFL. 

Let us assume that w ^ Equals and I — Q. For any two pairs (3:1,2/1), {x2,y2) G [N] x [A^] that force £ 
to be zero, we obtain (ai — 04) + Xi{a2 — a^) + yi{a^ — as) — for any index i e {1, 2}. From these two 
equations, it follows that (*) (xi — X2){a2 — as) — {1/2 — yi)(a3 — ae)- 

(1) Consider the case where 02 — as but as ^ as- By (*), we conclude that yi = 1/2; that is, there is a 
unique solution y for the equation £ — 0. Hence, the total number of {x,y) that forces ^ = is exactly A^, 
and thus e equals 1/A^, which is smaller than 1/3. The case where a2 7^ as and as — as is similar. 

(2) Consider the case where a2 7^ as and as ^ as- There are two cases to consider separately. 

(a) If a2 — as and as — ag are relatively prime, then we conclude that x = x' and y = y' from (*). This 
indicates that there is a unique solution pair (z, y) for the equation £ = 0- Thus, the error probability e is 
1/A^^, implying e < 1/3. 

(b) If a2 — as = /3(a3 — as) holds for a certain non-zero integer /3, then it follows that x — x' = (3{y' — y). 
Since jx — x'j, \y' — y\ < 4, there are at most 12 cases for {x, y, x' , y') with x > x' that satisfy x — x' = /3{y' — y) 
for a certain non-zero integer /3. Hence, e is at most 12/25, which is obviously smaller than 1/3. 

(3) The case where ai 7^ a4, a2 = as, and as = ag never occurs because £ is always non-zero. □ 

It is not clear whether BPCFL is located inside the CFL hierarchy, because a standard argument used 
to prove that BPP C E2 H ^2 requires an amplification property but a ppda cannot, in general, amplify its 
success probability [8J. 

Since a relationship between BPCFL and S2 nH2 is not clear, we may resort to an oracle separation 
between those language families. For this purpose, we will introduce a relativization of the target language 
families. First, we introduce a relativization of BPCFL. Similar to CFL^, BPCFL^ is defined simply 
by providing underlying ppda's with extra query tapes. Second, we define a relativized CFL hierarchy. 
Relative to this A, a relativized CFL hierarchy {A^^'^'"^, E^^'^'"^, H^^'^'"^ | fc £ N} consists of the following 

r -y A CFL. A A CFL. A .r^CFL.A t-tCFL.A t-i/^T7T ^ ^CFL,A r^Tj^j A T-rCFL,A ^CFL.A 

iamihes: Aq = = Lg =11^'= DCI'Lj., 1^^ = CI'Lj., IIj, = co-L^ j, , 

= DCFLT(Efc^^'^), and 11^+^^ = CFLT(n^^^-*) for all indices /fc > 1. As this definition hints, 
any oracle-dependent language in y]'^^^'^ can be recognized by a chain of k T-reduction machines whose 
last machine makes queries directly to A. As for later reference, we refer to this chain of k machines as a 
defining machine set for L^. 

Proposition .4. J 7 There exists a recursive oracle A such that BPCFL^^ ^ J]^^^''^- 

For the proof of Proposition 14.171 we need to consider certain non- uniform families of levelable Boolean 
circuits, where a circuit is levelable if (i) all nodes are partitioned into levels, (ii) edges exist only between 
adjacent levels, (iii) the first level of a circuit has the output node, (iv) the gates at the same level are of 
the same type, (v) two gates at two adjacent levels are of the same type, and (vi) all input nodes (labeled 
by literals) are at the same level (see, e.g., [B]). The notation CIRk{n,m) denotes the collection of all 
levelable circuits C that satisfy the following conditions: (1) C has depth k (i.e., k levels of gates), (2) the 
top gate (i.e., the root node) of C is OR, (3) C has alternating levels of OR and AND, (4) the bottom fan-in 
(i.e., the maximum fan-in of any bottom gate) of C is at most m, (5) the fan-in of any gate except for the 
bottom gates is at most n, and (6) there are at most n input variables. 

Lemma 4^.18 Let L^ be any oracle- dependent language over alphabet S in S^^^''^, where A is any oracle 
over alphabet Q. Let {Mi, M2, ■ - ■ , Mk) be a defining machine set for L^ . Let a and c be two positive constants 
such that the running time of each Mi is bounded from above by c\x\ and any query word y produced by Mk 
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has length at most a\x\, where x is an input string to . For every length n G and every input x G S", 
there exist a Boolean circuit C in C/i?fe+i (2"", ck) such that (1) all variables Xi,X2, ■ ■ ■ , x^^^^ of C are strings 
(and their negations) included in B-**" and (2) for any oracle A, it holds that x is in L"^ if and only if C 
outputs 1 on inputs (x'^{xi),x^{x2), ■ ■ ■ ,X"^ixe(n)))- 

Proof. We will prove this lemma by induction on fc > 1. We begin with the base case k — 1. Let 
be any oracle-dependent language in CFL^, where A is any oracle. Let M,a,c satisfy the premise of the 
lemma. Fix n and x G E" . By an argument similar to the proof of Proposition 13.131 we can modify M so 
that, before starting writing the zth query word it must guess its oracle answer 6; and produces feij/it] on 
a query tape and, instead of making an actual query, it assumes that A returns bi . After this modification, 
a string produced on a query tape must be of the form biyi^b2y2^ ■ • • tj^fj/^t]. Let Vx be composed of all such 
query strings produced along accepting computation paths. Note that || Vx ||< 2^°" since M halts within 
time an. 

Next, we will define a circuit C, which is an OR of ANDs, as follows. The top OR gate has edges labeled 
by strings in Vx- For each y G T4, an associated subcircuit Dy, consisting of an AND gate, has input nodes 

labeled by literals of the from y'^i^\ y^2^\ ■ ■ ■ , y'l'), where y] — yi and — yl. Let xi,X2, • . • , Xm(n) be all 
distinct variables (of the positive form) appearing in C . It is not difficult to verify that, for any oracle A, 
X G if and only if C(x^(a;i), x'^{x2), . • . , x'^ixmin))) = 1- 

Let us consider the induction step k > 2. Assume that is in CFLt{B^) for a certain oracle B"^ G 
n^Z"! '"^- Let M be a T-reduction machine reducing to B^. By a similar argument as in the base 
case, we can modify so that it generates query words of the from biyi^ - ■ ■^be,yi^ without making actual 
queries. We set Vx to be the collection of all such strings produced along accepting computation paths. 
Since B^ G Sj, ' , by the induction hypothesis, for each string y (zVx, there exists a circuit Dy satisfying 
the lemma. Instead of Dy, we consider its dual circuit Dy. Here, we define C to be an OR of all -D^'s for 
any y G 14. A similar reasoning as in the base case shows that, for any oracle A, x is in if and only if 
C{x^{xi), x'^(x2), . . . , X^(a;m(„))) = 1. □ 

Our example language is L"^ = {0" | |1 A n S" ||> 2""!}, where E = {0, 1}. To guarantee that G 
BPCFL;^, we wiU aim at constructing an oracle A satisfying that either ||AnE"||< 2"/3 or ||3nS"||< 2"/3 
for every length n G N'^ . This is done by recursively choosing a pair of reduction machines that define each 
language B^ in Yi'^^^'^ and by defining a large enough length n G N and a set An {= Ar\ E") such that 
0" G L-^V^ 0" G B^. 

Proof of Proposition B. 1 71 Let E = {0, 1} and consider an example language — {0" 1 1| AflE" ||> 2"^^}. 
To guarantee that G BPCFL^, we consider only oracles A that satisfy the condition (*) that either 
II An E" II < 2"/3 or II An E" ||< 2"/3 for every length 7i G N+. 

Next, we will construct an appropriate oracle A satisfying that ^ Ej^'"'^. For this purpose, we use 
Lemma 14.181 First, we enumerate all oracle-dependent languages in E^^^ and consider their corresponding 
depth-3 Boolean circuit families that satisfy all the conditions stated in Lemma [4. 181 

Recursively, we choose such a circuit family and define a large enough length n and a set A„ (= A n E"). 
Initially, we set no = and Ag = 0. Assume that, at stage i — 1, we have already defined n^-i and Ai_i. 
Let us consider stage i. Take the zth circuit family {C„}„gN and two constants a,c > given by Lemma 
14.181 First, we set = max{ni_i -I- 1, 2° -f 1, c' -I- 1}, where a' and c' are constants taken at stage i — 1. 
The choice of guarantees that A^ is not affected by the behaviors of the circuits considered at stage i — \. 

In the rest of the proof, we will examine two cases. 

(1) Consider the base case where the bottom fan-in is exactly 1. For each label y G {0, 1}°", let Q{y) be 
the set of all input variables that appear in subcircuits connected to the top OR gate by a wire labeled y. In 
particular, Q^(y) (resp., Q~{y)) consists of variables in Q{y) that appear in positive form (resp., negative 
form). Let us consider two cases. 

(a) Assume that there exists a string yo such that || Q^(yo) ||< 2"/3. In this case, we set A„ to be 
Q+(yo). It is obvious that 0" ^ and C„(x'^(a;i), . . . ,x^(a;2-0) = 1- 

(b) Assume that, for all y, || Q^(y) ||> 2""/3. Recursively, we will choose at most an/ log(3/2) + 1 strings. 
At the first step, let _Bo — {0, 1}°". Assume that has been defined. We will define Bi as follows. Choose 
lexicographically the smallest string w such that the set {y G -B^-i | w G Q^{y)} has the largest cardinality. 
Finally, we define Wi be this string w and set Bi — {y E | Wi ^ Q^{y)}. In what follows, we will show 
that ||B,||< (2/3) ||B.-i||. 
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Claim 12 \\B,\\< (2/3) 

Proof. Let d satisfy || |h d- \\ Bi^i \\. For each index i e [2""], let X, ^ {y \ Xi e Q+{y)}. Since 

II Q+{y) \\> 2'^'V3 for all y's, it holds that J2T=i II ^1 > (2'^"/3) II ||. Note that J^TSi II ^« IH 2"". 
Thus, we obtain d > 1/3. Since \\Bi |h||S,_i || - ||B, ||, it follows that ||Bj||< (2/3) ||B,_i ||. □ 

From the above claim, it follows that || Bi \\< (2/3)' || Bq ||= (2/3)'2°". Let io denote the minimal number 
such that II Bi ||= 0. Since i > an/ log(3/2) implies || Bi \\< 1, we conclude that io < an/ log(3/2) + 1. Now, 
we write W for the collection of all Wi's {I < i < io) defined in the above procedure. The desired An is 
defined to be (S"" -W)U {{Jy Q-{y)). 

(2) Second, we will consider the case where the bottom fan- in is more than 1. To handle this case, we 
will use a special form of the so-called switching lemma to reduce this case to the base case. 

A restriction is a map p from a set of n Boolean variables to {0, 1, *}. We define TZlf to be the collection 
of restrictions p on a domain of n variables that have exactly I unset variables and a q-fraction of the variables 
are set to be 1. For any circuit C, bf{C) denotes the bottom fan-in of C. 

Claim 13 1 Let C he a circuit of OR of ANDs with bottom fan-in at most r. Let n > 0, s > 0, £ = pn, 
andp< 1/7.' It holds that \\{p e U^^i \ 3D : AND of ORs[bf{D) > s]}||< {2pr/q^y ||7^^'«||. 

Consider any subcircuit D, an AND of ORs, attached to the top OR-gate. By setting q — 1/3 and r — c, 
we apply Claim[T2]to D. The probability that D is written as an OR of ANDs with bottom fan-in at most an 
is upper-bounded by 1 — (18pc)"". Moreover, the probability that all such subcircuits D are simultaneously 
written as circuits, each of which is an AND of ORs, is at most [1 — (ISpc)""]^"" > 1 — 2""(18pc)"" = 
1 - (36pc)''". If we choose p = l/72c, then the success probability is at least 1 - (36pc)"" > 1 - (1/2)°", 
which is larger than 1/2 for any integer n > 2 /a. Since every subcircuit D is written as an AND of ORs, 
the original circuit C can be written as an OR of ANDs with bottom fan-in at most an. Finally, we apply 
the base case to this new circuit. □ 

There also exists an obvious oracle for which BPCFL equals S2 ^'^ since the following equivalence holds. 

Proposition 4.19 BPCFL^^p^^'^ = j.cfl,pspace ^ pgpACE. 

Proof. It is obvious that BPCFLf C PSFACEf for every oracle B, where PSPACEf is a many-one 
relativization of PSPACE relative to B. Hence, it follows that BPCFL^^^"^^^ C PSPACE. Conversely, it 
holds that PSPACE C BPCFL^^P^^'^. The case of sCfl.pspace gj^^jij^j. □ 

We have just seen an oracle that supports the inclusion BPCFL C and another oracle that does 

not. As this example showcases, some relativization results are quite counterintuitive. Before closing this 
subsection, we will present another plausible example regarding a parity NFA language family ©NFA whose 
elements are languages of the form {x \ \\ ACCm{x) ||= 1 (mod 2)} for arbitrary nfa's M . In the unrelativized 
world, it is known that ©NFA C TC^ C PH; however, there exists another oracle that defies this fact. 

Lemma 4-20 There exists an oracle such that 0NFA^j ^ PH'^. 

Proof. Let us consider a special language L'^ = {0" | ©^.g^n X^{^) = 1 (mod 2)} relative to oracle A. 
It is easy to show that, for any oracle A, L^ is in 0NFA^j by guessing a string x in E" and querying it to 
A. Since it is shown in ^ that L^ ^ PH"^ for a random oracle A, we immediately obtain the desired oracle 
separation. □ 



5 A Close Relation to the Polynomial Hierarchy 

Through the last section, the CFL hierarchy has proven to be viable in classifying certain languages and it 
can be characterized by two natural ways, as shown in Theorem 14.111 Moreover, we known that the first 
two levels of the CFL hierarchy are different (namely, Ef^'^ 7^ Ej^'^); however, the separation of the rest of 
the hierarchy still remains unknown. In this section, we will discuss under what conditions the separation is 
possible. 

Given a language A, a language L is in L^ if there exists a logarithmic-space (or log-space) DTM M with 
an extra write-only query tape (other than a two-way read-only input tape and a two-way read/write work 
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tape) such that, for every string x, a; is in L if and only if M on the input x produces a string in A. RecaU that 
any tape head on a write-only tape moves only in one direction. For a language family C, we denote by the 
union (Jy^g^ ^m- Occasionally, we also write Lm(C) to mean L^. In particular, when C = CFL, the language 
family L^fl has been known as LOGCFL (LogCFL or LOG(CFL)) in the literature. Since = CFL, 

it follows from [TB] that Lm(Sf^'^) = SAC^. As for the language family PCFL, for instance, Macarie and 
Ogihara tl3j demonstrated that L^^^fl g rp,^! Concerning BPCFL, the containment L^p^^^i^ C h^{Y.f^) 
holds. This is shown as follows. From BPCFL C PCFL, it follows that L^p^fl ^ lP^fl ^ r^^i jjowever, 
by Claim m L„(SpL) ^ r^j^^g^ ^l^^^^ ^^Lat LJ^p^fl ^ tC^ C NP = L„(E^pl)_ 

It is obvious that Ci = C2 implies L^J = Lf^; however, the converse does not always hold. Here is a 
simple example. Although CFL(fc) ^ CFL holds for k > 2, the following equality holds. 

Lemma 5.1 L™'"^^"^ = L^fl ^ SAC^ 

Proof. As noted before, it is known that L^P'^ — SAC"'^. Therefore, our goal is set to prove that 
L^^^^''^ = L^FL for any index fc > 2. Note that L^fl = L™'"^'^^ C L™'"^^''^ for aU indices k e N+. The 
remaining task is to show that Lm^'"''^'' C LJ^P'". Let k > 2 and assume that A G L^j for a certain language 
B G CFL(fc). There are k languages Bi, B2, . . . , Bk £ CFL satisfying that B — Hieffc] ^i- Take any log-space 
oracle DTM M that recognizes A relative to B. 

We define a new reduction machine iVi so that, on input x, it outputs ytjyt] • ■ • tjy {k y's) if M on input x 
outputs y. Define C = {yi[\y2i\ ■ ■ ■ tl2/fc | Vi G [fc] [yt G Bi]}. Obviously, it holds that A^i(a;) G C iff M{x) G B. 
It thus suffices to show that C is in CFL. 

To show that C G CFL, let us consider an npda N2 that behaves as follows. On input w — yi^y2\\ ■ ■ ■ \]yk, 
N2 sequentially simulates Mi on input j/^, starting with i = 1. After each simulation of Mi{yi), we always 
clear the stack so that each simulation does not affect the next one. Moreover, as soon as Mi rejects y^, N2 
enters a rejecting state. It is obvious that C is recognized by iV2- ^ 

The CFL hierarchy turns out to be a useful tool because it is closely related to the polynomial hierarchy 
{AP,I]P,nP I fc G N}. Reinhardt [13] first established a close connection between his alternating hierarchy 
over CFL and the polynomial hierarchy. Similar to S^g'", the notation S^^ stands for the eth level of the 
Boolean hierarchies over E^. We want to demonstrate the following intimate relationship between E^PJ^g 
and EP ,. 

Theorem 5.2 For every index e,fc G N+, L,„(E^P^g) = EP^ holds. 

Proof. Fixing k arbitrarily, we will show the theorem by induction on e G Our starting point is the 
base case where e — 1. 

Claim 14 Lmi^k+i) ~ ^fc Isolds for every index k G 

Proof. We want to demonstrate separately that, for every index fc G (1) L,„(E^p^) C eP and (2) 
yP r T ryCFLN 

(1) To prove that L.,„(E^pJ') C EP, we start with the following useful relationship between ^^+1 and 
Claim 15 'S'^+i — ^fc holds for every index k G 

Proof. This claim will be proven by induction on fc > 1. A key to the following proof is the fact that 
CFL^ C NP^ holds for every oracle A. When fc = 1, it holds that E^p^ ^ CFL^p^ C NP^p^. Since 
CFL C P, we obtain NP^p^ C NpP = NP, yielding the desired containment E^^ C NP. When fc > 2, by 
the induction hypothesis, we assume that E^p^ C EP_;^. It therefore follows that E^pp = CFLT(n^PP) C 

NP(nCFP)cNP(nP_i) = sP. □ 



The inclusion Lm(E^PP) C Lm(EP) follows from Claim [T5] Hence, using the fact that Lm(EP) C EP, we 
conclude that L™(E^pP) C E);?. 

(2) Next, we will show that EP C L,„(E^pP). An underlying idea of the following argument comes from 
[14] . Our plan is to define a set of fc quantified Boolean formulas, denoted QBFk, which is slightly different 
from a standard BQFk, and to prove that (a) QBFk is log-space complete for Ej; and (b) QBFk indeed 
belongs to E^fl. Combining (a) and (b) implies that EP C L^^^" C L„(ECFL). 
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First, we will discuss the case where k is odd. The language QBFk must be of a specific form so 
that an input-tape head of an oracle npda can read through a given instance of QBFk from left to 
right without back-tracking. First, we prepare the following alphabet of distinct input symbols: = 
{3, V, A, t], -(-, — , 0, oi, 02, . . . , flfe}. A string belongs to QBFk exactly when (f> is of the following form: 
3a™^Va™^ • • • Q/ca™'°t]ci A C2 A • • • A c,„, where each rrii and m are in N+, Qk is 3, each a is a string 
Ci/iCi^g.-i ■ ■ ■ Ci^i in {-|-,— ,0}^' for a certain number £i satisfying £i > m — X]^=i"^ii and, moreover, the 
corresponding quantified Boolean formula 

[Ci A C2 A • • • A C„] 

is satisfied, where m' = m — nik, C'i is a formula of the form (Vjes+(t) ^j) V (Vjes_(i) — {j ^ M I 

j < TO,Ci,f,_j+i = -f}, and S-{i) = {j 6 [^] | j < rn,Ci^i,-j+i = 

When k is even, we define BQFk by exchanging the roles of A and V and by setting Qk = V in the 
above definition. As is shown in |15j . it is possible to demonstrate that QBFk is log-space many-one 
complete for E^; that is, every language in belongs to L^j-^^*". If QBFk is in E^jjJ'^, then we obtain 

C L^J^P- C L,„(ECFL), as requested. 

Therefore, what remains undone is to prove that QBFk is indeed in E^jjl^J' by constructing a chain of 
m-reduction machines for QBFk- As done in Section [4.3l we also call such a chain of m-reduction machines 
computing QBFk by a defining machine set for QBFk- Given an index i S [fc], we define a string (pi as 
ipi = Qia™"^ ■ ■ ■ Qka^''^ci A C2 A • ■ • A Cm- Now, let -01 denote 0i. The {i + l)st machine works as follows. 
Assume that an input string ^pi to the machine has the form ["1^^] • • • ["i^^^ ](t>i- While reading nii + l symbols 
of "QiO™'" until the next symbol Qi+i, the machine generates all strings Si — SiiSi2 • • • in {0, 1}™' and 
then produces corresponding strings [ • • • [ "^i/J^i+i on a query tape. Note that, at any moment, if the 
machine discovers that the input does not have a valid form, it immediately enters a rejecting state and 
halts. The last machine N works in the following manner, when ipk is given as an input. First, N stores the 
string [°]^^] • • • ["^-1 ] in its stack, guesses a binary string Sk of length trik, and stores [°'^^] also in the stack. 
Second, N guesses j, locates the block of Cj, and checks whether its corresponding Boolean formula Cj is 
satisfied by the assignment specified by S1S2 ■ ■ ■ Sk- This checking process can be easily done by comparing 
the two strings S1S2 • • • Sfc and Cj symbol by symbol as follows: if corresponds to Cij.-j+i, then N accepts 
the input exactly when (s^ = 1 A q^^.^j+i = +) or (s^ = A Q^^^-j+i = — ). The existence of a defining 
machine set for QBFk proves that QBFk indeed belongs to E^^]". □ 

Finally, let us consider the case where e > 2. A key to the proof of this case is the following simple fact. 
For convenience, we say that a language family C admits input redundancy if, for every language L in C, two 
languages L' = {x[\y \ x ^ L} and L" — {xi\y \ y & L} are both in C, provided that t] is a fresh symbol that 
never appears in x as well as y. 

Claim 16 If two language families Ci andC2 admit input redundancy, then L'^AL'^ C L^J^''^ and L^JVL^^ C 

T C1VC2 
ni 

Proof. We will show only the first assertion of the lemma, because the second assertion follows similarly. 
Take any language L and assume that L g L^^ ^^m^ for certain two languages Ai S Ci and A2 £ C2. there are 
two log-space m-reduction machines A/i and M2 such that, for every string a:, x is in L if and only if A//'(x) 
outputs yi and yi € Ai for any index i 6 {1,2}. Now, we want to define another machine M as follows. On 
input X, M first simulates Mi on x and produces yit] on its query tape. Moreover, M simulates M2 on x and 
produces y2 also on the query tape following the string yit). The language C — {yi^y2 \ yi G ^1,2/2 G A2} 
clearly belongs to Ci A €2- Thus, it is obviously that L is in C L^J^''^. □ 

Claim[iniimpliesthatL„(ECFL^^^J = L„(ECFl^^VECFL) ^ L„(ECFL^ JvL„(ECFL). Since L™(SCFL) = 
EP and Lm{^kli,2e) = ^k,2e the induction hypothesis, we obtain L™(ECFL_^^^^) ^ sP-^e V E^ = EP2e+i- 
Similarly, we conclude that ^-mi^k+i 2e+2) ^ ^fc'2e+2- '-' 

Unfortunately, the proof presented above does not apply to obtain, for instance, L,„(Ej?fl 
EF n simply because no many-one complete languages are known for EF n HF. This remains as a 
challenging question. 

An immediate consequence of Theorem 15.21 is given below. 
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Corollary 5.3 If the polynomial hierarchy is infinite, then so is the Boolean hierarchy over S^^'" at every 
level k > 2. In particular, if the polynomial hierarchy is infinite, then so is the CFL hierarchy. 

Proof. Let k >2. Theorem 15 . 2 1 implies that, if the Boolean hierarchy over is infinite, then the Boolean 
hierarchy over S^^]^ is also infinite. Earlier, Kadin [3] showed that, if the polynomial hierarchy is infinite, 
then the Boolean hierarchy over is infinite for every index fc > 1. By combining those statements, we 
instantly obtain the desired consequence. □ 

More specifically, the following separation holds for the fcth level and the (fc + l)st level of the CFL 
hierarchy. 

Corollary 5.4 Let k>2. //PH ^ S^, then S^fl ^ ^^fl. 

In terms of the inclusion relationship, as shown in Claim [T51 S^^l jg upper-bounded by E^. It is possible 
to show that this bound is tight under the assumption that A^^j^ is different from S^^-^. 



Corollary 5.5 For any index k >2, if A^^^ ^ ^fc+iJ then S^^l ^ 



Proof. We want to show the contrapositive of the corollary. We start with the assumption that S^^l q 
S^. From this inclusion, it follows that — Lm(S^). Since Lm(S^FL-) _ holds by Theorem 

15.21 we obtain C L„j(S^. On the contrary, it holds that Lm(S^) C P^fc = A^_^^. As a result, S^^^ is 

included in A^_^j^. Since A^_^j^ C T,^^-^ is obvious, A^_^j^ = follows immediately. □ 

Let us recaU that CFL(a;) C S^fl p U'^^^ by Proposition O and CFL(a;) C SAC^ C NC^ by Lemma 
Em If S^FL g j^Qjjg^ ^^^^ Claim Us] implies that NP = L™(S^fl-) ^ L™(NC^) = NC^ and we obtain 
NP = NC since NC^ C NP is obvious. In the end, we reach the following conclusion. 

Corollary 5.6 IfNP ^ NC^ then S^fl ^ ^(^2 _ 



We have focused our attention to the computational complexity of languages that are log-space many-one 
reducible to certain languages in S^fl_ j.gg|; qJ- ^j^jg gggtion, we will turn our attention to log-space 

truth-table reducible languages to S^fl_ jjere, we use the notation L^ (or Ltt{A)) to mean a family of 
languages that are log-space truth-table reducible to A. It is easy to show that a truth-table reduction can 
simulate Boolean operations that define each level of the Boolean hierarchy over CFL. Hence, the Boolean 
hierarchy BHCFL is "equivalent" to CFL under the log-space truth-table reducibility. 

Lemma 5.7 LBHCFL ^lC/l. 



Proof. We need to show the equality Lj^^'^* = L^fl f^j. every index fc > 1. Since CFL C CFL^, it holds 
that L^FL L^^'^*. Conversely, we will show that L^^'"'' C L^fl j^y induction on A: > 1. Since the base 



case A: = 1 is trivial, we hereafter assume that k>2. Let L be any language in L^^'"'". Moreover, let M be 
a log-space oracle DTM M and A be an oracle A in CFL^ such that M tt-reduces L to A. Now, consider 
the case where k is even. We will construct a new oracle npda N as follows. On input x, when M produces 
m query words yi, 2/2, ■ • ■ , 2/m, N produces 2m query words Oj/i, lyi,Qy2, ly2, ■ • • , Oj/„i, Ij/m- Assume that 
A = B n C for two appropriate languages B E CFLfe_i and C £ co-CFL. We define B' = {Oy \ y G B} and 
C' = {ly \ y ^ C} and we set A' to be B' U C", which is in CFLfc_i V CFL = CFLfc_i since CFLfc_i is closed 
under union with CFL. Note that yi is in A if and only if Oj/i ^ B' and It/i e C". We use this equivalence 
relation as a truth table to judge the membership of x to L. 

When k is odd, since A = BUC ioi certain languages B e CFLfc_i and C G CFL, it suffices to transform 
B and C to B' = {Oy \ y £ B} and C = {ly \ y E C}. The rest of the argument is similar to the previous 
case. □ 

Wagner [TO] introduced a convenient notation Q]^^i as an abbreviation of Pt(SF [0(log n)]) for each level 
k > 1, where the script "[O(logn)]" means that the total number of queries made in an entire computation 
tree on an input of size n to an oracle in is bounded from above by c log n + d for two absolute constants 
c,d>0. 

Theorem 5.8 For all levels k>l, Lttil^'^ll) = O^+i holds. 

Proof. Let k > 1. First, we will give a useful characterization of 0^_,_^ in terms of using two 
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different truth-table reductions. In a way similar to L^, the notation (or Ptt{A)) is introduced using 
polynomial-time DTMs instead of log-space DTMs. 

Claim 17 For every index k G it holds that = P4f(E^) = Lit(E^). 

Proof. It suffices to show that Ptti^l) C Ltt(SP) and 6^+^ = Ptt(E^) since Ltt(SP) C Pu{^l) is obvious. 
Note that the proof of P^^ C L^p by Buss and Hay g] relativizes; namely, Ptt(NP^) C Ltt(NP^) for any 
oracle A. Recall the language QBFk defined in the proof of Claim [T5] By choosing QBFk-i for A, we obtain 
Pu{T.l) = P„(NP«^^'=-i) C Ltt(NP'?^^^-i) = Ltt(SP). Moreover, the proof of P?^[°('°«"^l = P^p given 
in, e.g., [3] also relativizes. By a similar argument as above, it also follows that PT(EP[0(logn]) = Ptt(EP). 
□ 

Since we have earlier shown that S^pJ" C EP, it follows that Ltt(E^PJ') C Ltt(EP) = O^+i, where the 
last equality comes from Claim [T71 In what follows, we intend to argue that Q^^i C Ltt(E^PJ'). Assume 
that L e thus, L is in Lf4(EP) by Claim [T71 Since QBFk is log-space many-one complete for EP, 

we can replace EP with QBFk. Since EP C Lf^^^", it holds that L e Ltt(L^^^'=) = L^^^\ Since QBFk 
belongs to ^k+i the proof of Claim [iSl we conclude that L e Lft(E^P^). □ 
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